Get all your news in one place.
100’s of premium titles.
One app.
Start reading
Tom’s Hardware
Tom’s Hardware
Technology
Aaron Klotz

Stable Diffusion Optimized for Intel Silicon Boosts Arc A770 Performance by 54%

Arc A770 Limited Edition

Automatic1111's Stable Diffusion WebUI now works with Intel GPU hardware, thanks to the integration of Intel's OpenVINO toolkit that takes AI models and optimizes them to run on Intel hardware. We've re-tested the latest release of Stable Diffusion to see how much faster Intel's GPUs are compared to our previous results, with gains of 40 to 55 percent.

Stable Diffusion (that currently has our previous testing, though we're working on updating the results) is a deep-learning AI model used to generate images from text descriptions. What makes Stable Diffusion special is its ability to run on local consumer hardware. The AI community has plenty of projects out there, with Stable Diffusion WebUI being the most popular. It provides a browser interface that's easy to use and experiment with.

After months of work in the background (we've been hearing rumblings of this for a while now), the latest updates are now available for Intel Arc owners and provide a substantial boost to performance.

Here are the results of our previous and updated testing of Stable Diffusion. We used a slightly tweaked Stable Diffusion OpenVINO for our previous testing, and have retested with the fork of Automatic1111 webui with OpenVINO. We also retested several of AMD's GPUs with a newer build of Nod.ai's Shark-based Stable Diffusion. The Nvidia results haven't been updated, though we'll look at retesting with the latest version in the near future (and update the main Stable Diffusion benchmarks article when we're finished).

We should note that we also changed our prompt, which makes the new results generally more demanding. (The new prompt is "messy room," which tends to have a lot of tiny details in the images that require more effort for the AI to generate.) There's variation between runs, and there are caveats that apply specifically to Arc right now, but here are the before/after results.

(Image credit: Future)
(Image credit: Future)

The Intel ARC and AMD GPUs all show improved performance, with most delivering significant gains. The Arc A770 16GB improved by 54%, while the A750 improved by 40% in the same scenario. (Note that we used the Intel Arc A770 Limited Edition card for testing, which is now discontinued, though Acer, ASRock, Sparkle, and Gunnir still offer A770 cards — both 16GB and 8GB variants.)

Nod.ai hasn't been sitting still either. AMD's RX 6800, RX 6750 XT, and RX 6700 10GB are all faster, with the 6800 and 6700 10GB in particular showing large gains. We're not sure why the 6750 XT didn't do as well, but the RX 6800 saw a performance boost of 34% and the RX 6700 10GB saw an even greater 76% performance improvement. The RX 6750 XT for some reason only saw a measly 9% increase, even though all three AMD GPUs share the same RDNA2 architecture. (We'll be retesting other GPUs, including AMD's newer RX 7000-series parts, in the near future.)

Again, we did not retest the three Nvidia RTX 40-series GPUs, which is why the performance statistics remain identical between the two graphs. Even so, with the new OpenVINO optimizations, Intel's Arc A750 and A770 are now able to outperform the RTX 4060, and the A770 16GB is close behind the RTX 4060 Ti.

There's still plenty of ongoing work, including making the installation more straightforward, and fixes so that other image resolutions and Stable Diffusion models work. We had to rely on the "v1-5-pruned-emaonly.safetensors" default model, as the newer "v2-1_512-ema-pruned.safetensors" and "v2-1_768-ema-pruned.safetensors" failed to generate meaningful output. Also, 768x768 generation currently fails on Arc GPUs — we could do up to 720x720, but 744x744 ended up switching to CPU-based generation. We're told a fix for the 768x768 support should be coming relatively soon, though, so Arc users should keep an eye out for that update.

Update, 8/17/2023: The fix is live. To get 768x768 working, go the the directory where you've installed Stable Diffusion OpenVINO, and run: "venv\Scripts\activate" and then "pip install --pre openvino==2023.1.0.dev20230811" and generation of higher resolution images should work. We successfully tested 768x768 on an A750, where previously even the A770 16GB failed and seemed to run out of VRAM.

Update, 8/18/2023: A second fix is now available, which enabled v2-1 support alongside the above 768x768 support. The details on how to get v2-1 working are detailed at the bottom of this page. In short, download the v2-1 safetensors file, put that in the models folder. Also download the v2-inference.yaml file and put it in the configs folder. You should now be able to select the appropriate model and configuration in Stable Diffusion.

Sign up to read this article
Read news from 100’s of titles, curated specifically for you.
Already a member? Sign in here
Related Stories
Top stories on inkl right now
Our Picks
Fourteen days free
Download the app
One app. One membership.
100+ trusted global sources.