Intel GPU prices have stayed fine, but I do wonder if they are viable for Inference if they will wind up like Nvidia GPUs, severely overpriced.
cmovq [3 hidden]5 mins ago
I mean it kind of is considering that's comparable to a 5070 which has 672 GB/s? Benefit of NVIDIA being the only one using GDDR7 for now I guess.
daemonologist [3 hidden]5 mins ago
7800 XT has 624 GB/s as well, and can be found for $400 used. 16 GB of course.
jmward01 [3 hidden]5 mins ago
I think this shows a shift in model architecture. MOE and similar need more memory for the compute available than just one big model with a lot of layers and weights. I think this is likely a trend that will accelerate. You build the trade-off in which encourages even more experts which means more of a tradeoff, so more experts.....
zozbot234 [3 hidden]5 mins ago
Most people doing local inference run the MoE layers on CPU anyway, because decode is not compute constrained and wasting the high-bandwidth VRAM on unused weights is silly. It's better to use it for longer context. Recent architectures even offload the MoE experts to fast (PCIe x4 5.0 or similar performance) NVMe: it's slow but it opens up running even SOTA local MoE models on ordinary hardware.
tbyehl [3 hidden]5 mins ago
Where's the A310 / A40 successor? Gimme some SR-IOV in a slot-powered, single-width, low-profile card.
SkyeCA [3 hidden]5 mins ago
32GB of vram for a decent price? I wonder if these will work well for VR, because vram is my current main issue.
aruametello [3 hidden]5 mins ago
(VR enthusiast here, mostly under windows)
intel support has been mild to non existent in the VR space unfortunately. Given the very finicky latency + engine support i wouldn’t bet on a great experience, but hope for the best for more competition in this market. (even amd has a lot of caveats comparing to nvidia)
Footnotes:
* critical "as low as it can be" low latency support on intel XE is still not as mature as nvidia, amd was lagging behind until recently.
* Not sure about "multiprojection" rendering support on intel, lack of support can kill vr performance or make it incompatible. (the optimized vr games often rely on it)
pjmlp [3 hidden]5 mins ago
New cards in 2026, and targeting Vulkan 1.3?!
mikelitoris [3 hidden]5 mins ago
Too little too late, classic Intel
nickthegreek [3 hidden]5 mins ago
Both have 32gb vram. Could be a pretty compelling choice.
cptskippy [3 hidden]5 mins ago
They certainly look viable as replacements for my Tesla P40 for virtual workloads.
whalesalad [3 hidden]5 mins ago
Anyone running an ARC card for desktop Linux who can comment on the experience? I've had smooth sailing with AMD GPU's but have never tried Intel.
himata4113 [3 hidden]5 mins ago
Linus Torvalds runs ARC :)
oakpond [3 hidden]5 mins ago
Running dual Pro B60 on Debian stable mostly for AI coding.
I was initially confused what packages were needed (backports kernel + ubuntu kobuk team ppa worksforme). After getting that right I'm now running vllm mostly without issues (though I don't run it 24/7).
At first had major issues with model quality but the vllm xpu guys fixed it fast.
Software capability not as good as nvidia yet (i.e. no fp8 kv cache support last I checked) but with this price difference I don't care. I can basically run a small fp8 local model with almost 100k token context and that's what I wanted.
robertVance [3 hidden]5 mins ago
Ive ran arc on fedora for years and for general desktop use it’s been perfect. For llm’s/coding it’s getting better but it’s rough around the edges. Had a bug where trying to get vram usage through pytorch would crash the system, ect.
Levitating [3 hidden]5 mins ago
Afaik driver support is very complete on Linux. You often see Arc GPUs used in media transcoding workloads for that reason.
bpye [3 hidden]5 mins ago
My B580 works fine on Linux. Graphics perf is a bit worse than under Windows, but supposedly compute is pretty much the same.
wyre [3 hidden]5 mins ago
There was the video a little while back where LTT built a computer for Linus Torvalds and they put an Intel Arc card inside, so I'd imagine Linux support is at the very least, acceptable.
I'm running A-series Arc for media transcoding and it works just fine.
WarmWash [3 hidden]5 mins ago
Wake me when they wake up and release a middling card with 128GB memory.
zozbot234 [3 hidden]5 mins ago
Buy Strix Halo or Apple Silicon platforms and you get essentially that.
Weryj [3 hidden]5 mins ago
Buy 4?
electronsoup [3 hidden]5 mins ago
Which mainboards are cheap and have 4 pcie16x (electrical) slots, that don't need weird risers to fit 4 GPUs
irishcoffee [3 hidden]5 mins ago
If your actual gripe is risers, sounds like a "you" problem, not a technical problem.
WarmWash [3 hidden]5 mins ago
Because I don't want to spend $4k.
I want to spend $1500 for a card that can run a proper large model, even if it only can do 25 tk/s.
Intel is squandering a golden opportunity to knee-cap AMD and Nvdia, under the totally delusional pretense that intel enterprise cards still have a fighting chance.
ericd [3 hidden]5 mins ago
I saw a good quote recently, "you're not going to get 128 gigs of vram loose in a plastic bag for that much".
DiabloD3 [3 hidden]5 mins ago
Since they fired the entire Arc team and a lot of the senior engineers already updated their Linkedins to reflect their new positions at AMD, Nvidia, and others, as well as laying off most of their Linux driver team (GPU and non-GPU), uh...
WTF?
staticman2 [3 hidden]5 mins ago
You are exaggerating, right? They didn't really fire the entire Arc team did they? I couldn't find a source saying that.
DiabloD3 [3 hidden]5 mins ago
Nope, no exaggeration.
The news that Celestial is basically canceled already hit the HN front page, as well as Druid has been canceled before tapeout.
Celestial will only be issued in the variant that comes in budget/industrial embedded Intel platforms that have a combined IO+GPU tile, but the performance big boy desktop/laptop parts that have a dedicated graphics tile will ship an Nvidia-produced tile.
There will be no Celestial DGPU variant, nor dedicated tile variant. Drivers will be ceasing support for DGPUs of all flavors, and no new bug fixes will happen for B series GPUs (as there is no B series IGPUs; A series IGPUs will remain unaffected).
They signed the deal like 2-3 months ago to cancel GPUs in favor of Nvidia. The other end of this deal is the Nvidia SBCs in the future will be shipping as big-boy variants with Xeon CPUs, Rubin (replacing Blackwell) for the GPU, Vera (replacing Grace) for the on-SBC GPU babysitter, and newest gen Xeons to do the non-inference tasks that Grace can't handle.
There is also talk that this deal may lead to Nvidia moving to Intel Foundry, away from TSMC. There is also talk that Nvidia may just buy Intel entirely.
For further information, see Moore's Law Is Dead's coverage off and on over the past year.
chao- [3 hidden]5 mins ago
You may be a bit too credulous. There has been a "leak" or "rumor" that Intel's GPU initiatives are canceled about once every three months, for over two years. Yet Intel continues to release new SKUs and make new product announcements. Just last month they announced a new data center GPU product (an inference-focused variant of Jaguar Shores).
I can't see the future, but I can see patterns: the media that reports straight from the industry rumor mill LOVES this "Intel has cancelled its GPUs" story, for whatever reason. I have no particular love for Intel (out of my six current systems, my only Intel box is a cheap NUC from 2018), but at this point, these rumors echo the old joke about economists who "accurately predicted the last nine out of two recessions".
gk-- [3 hidden]5 mins ago
ah, so this is MLID. yeah i'll wait for the announcement.
PowerElectronix [3 hidden]5 mins ago
MLID is a terrible information source.
wtallis [3 hidden]5 mins ago
This is a chip they've had lying around for a while. It's the same architecture as used in the Arc B580 that launched at the end of 2024; this is just a slightly larger sibling. Intel clearly knew that their larger part wouldn't make for a competitive gaming GPU (hence the lack of a consumer counterpart to these cards), but must have decided that a relatively cheap workstation card with 32GB might be able to make some money.
DiabloD3 [3 hidden]5 mins ago
Still seems crooked to sell a GPU that is already lost their driver team and will get no new meaningful updates.
wtallis [3 hidden]5 mins ago
Does it need a huge driver team pushing out big updates in order to be suitable for the kind of Pro use cases it's targeted at? They're explicitly not going after the gaming market so they don't need to be on the treadmill of constant driver updates delivering workarounds and optimizations for the latest game releases.
They're still going to be employing some developers for driver maintenance for the sake of their iGPUs, and that might be enough for these cards.
unethical_ban [3 hidden]5 mins ago
I didn't know this. Have they officially given up on building discrete GPUs? Is this a last gasp of Arc to offload decent remaining architectures at a lower price than nvidia?
It is crazy to me that a world newly craving GPU architecture for AI, and gamers being largely neglected, that Intel would abandon an established product line.
StilesCrisis [3 hidden]5 mins ago
It does sound like a very Intel choice though.
mschuster91 [3 hidden]5 mins ago
> It is crazy to me that a world newly craving GPU architecture for AI, and gamers being largely neglected, that Intel would abandon an established product line.
You still need to fab it somewhere. Intel's fabs have been plagued with issues for years, the AI grifters have bought up a lot of TSMCs allotments and what remains got bought up by Apple for their iOS and macOS lineups, and Samsung's fabs are busy doing Samsung SoCs.
And that unfortunately may explain why Intel yanked everything. What use is a product line that can't be sold because you can't get it produced?
Yet another item on my long list of "why I want to see the AI grift industry burn and the major participants rotting in a prison cell".
vessenes [3 hidden]5 mins ago
Not sure why you'd want this over an apple setup. M4 max is 545GB/s of memory bandwidth - $2k for an entire Mac Studio with 48GB of RAM vs 32 for the B70.
protimewaster [3 hidden]5 mins ago
My thinking is that I'd pick this, because I can't just plug a Mac into a slot in my server and have it easily integrate with all my other hardware across an ultra fast bus.
If they made an M4 on a card that supported all the same standards and was price competitive, though, that might be a good option.
hedgehog [3 hidden]5 mins ago
Being able to keep infrastructure on Linux is a big advantage.
RestartKernel [3 hidden]5 mins ago
How many compatibility issues is MacOS realistically expected to spur? Windows DX felt unusable to me without a Linux VM (and later WSL), but on MacOS most tooling just kinda seems to work the same.
einr [3 hidden]5 mins ago
It’s not the tooling for me, macOS is just bad as a server OS for many reasons. Weird collisions with desktop security features, aggressive power saving that you have to fight against, root not being allowed to do root stuff, no sane package management, no OOB management, ultra slow OS updates, and generally but most importantly: the UNIX underbelly of macOS has clearly not been a priority for a long time and is rotting with weird inconsistent and undocumented behaviour all over the place.
wolfhumble [3 hidden]5 mins ago
> Weird collisions with desktop security features
Linux is not immune to BIOS/UEFI firmware attacks either. Secure Boot, TPM, and LUKS can work well together, but you still depend on proprietary firmware that you do not fully control. LogoFAIL is a good example of that risk, especially in an evil maid scenario involving temporary physical access. I think Apple has tighter control over this layer.
hedgehog [3 hidden]5 mins ago
Provisioning, remote management, containers, virtualization, networking, graphics (and compute), storage, all very different on Mac. The real question is what you would expect to be the same.
bigyabai [3 hidden]5 mins ago
For server usage? macOS is the least-supported OS in terms of filesystems, hardware and software. It uses multiple gigabytes of memory to load unnecessary user runtime dependencies, wastes hard drive space on statically-linked binaries, and regularly breaks package management on system upgrades.
At a certain point, even WSL becomes a more viable deployment platform.
fvv [3 hidden]5 mins ago
with those $2k you can have 2xB70, with 1.2Tb/sec and 64G Vram, on linux ( and you can scale further while mac prices increase are not linear 0
Reubend [3 hidden]5 mins ago
You're absolutely right. And these Intel GPUs will also be much faster in terms of actual math than the M series GPUs that the Apple setup would have.
cptskippy [3 hidden]5 mins ago
Support for Single Root IO Virtualization (SR-IOV) to enable compute and Graphics workloads in virtualized environments.
2OEH8eoCRo0 [3 hidden]5 mins ago
Funny, I not sure why anyone would use Apple over Linux.
wyre [3 hidden]5 mins ago
one can upgrade and swap parts with a computer running an Intel GPU. Linux is very well supported compared to Mac hardware.
~$1000 for the Pro B70, if Microcenter is to be believed:
https://www.microcenter.com/product/709007/intel-arc-pro-b70...
https://www.microcenter.com/product/708790/asrock-intel-arc-...
https://www.bhphotovideo.com/c/product/1959142-REG/intel_33p...
When 32GB NVIDIA cards seem to start at around $4000 that's a big enough gap to be motivating for a bunch of applications.
intel support has been mild to non existent in the VR space unfortunately. Given the very finicky latency + engine support i wouldn’t bet on a great experience, but hope for the best for more competition in this market. (even amd has a lot of caveats comparing to nvidia)
Footnotes:
* critical "as low as it can be" low latency support on intel XE is still not as mature as nvidia, amd was lagging behind until recently.
* Not sure about "multiprojection" rendering support on intel, lack of support can kill vr performance or make it incompatible. (the optimized vr games often rely on it)
I was initially confused what packages were needed (backports kernel + ubuntu kobuk team ppa worksforme). After getting that right I'm now running vllm mostly without issues (though I don't run it 24/7).
At first had major issues with model quality but the vllm xpu guys fixed it fast.
Software capability not as good as nvidia yet (i.e. no fp8 kv cache support last I checked) but with this price difference I don't care. I can basically run a small fp8 local model with almost 100k token context and that's what I wanted.
[1] https://www.youtube.com/watch?v=mfv0V1SxbNA
I want to spend $1500 for a card that can run a proper large model, even if it only can do 25 tk/s.
Intel is squandering a golden opportunity to knee-cap AMD and Nvdia, under the totally delusional pretense that intel enterprise cards still have a fighting chance.
WTF?
The news that Celestial is basically canceled already hit the HN front page, as well as Druid has been canceled before tapeout.
Celestial will only be issued in the variant that comes in budget/industrial embedded Intel platforms that have a combined IO+GPU tile, but the performance big boy desktop/laptop parts that have a dedicated graphics tile will ship an Nvidia-produced tile.
There will be no Celestial DGPU variant, nor dedicated tile variant. Drivers will be ceasing support for DGPUs of all flavors, and no new bug fixes will happen for B series GPUs (as there is no B series IGPUs; A series IGPUs will remain unaffected).
They signed the deal like 2-3 months ago to cancel GPUs in favor of Nvidia. The other end of this deal is the Nvidia SBCs in the future will be shipping as big-boy variants with Xeon CPUs, Rubin (replacing Blackwell) for the GPU, Vera (replacing Grace) for the on-SBC GPU babysitter, and newest gen Xeons to do the non-inference tasks that Grace can't handle.
There is also talk that this deal may lead to Nvidia moving to Intel Foundry, away from TSMC. There is also talk that Nvidia may just buy Intel entirely.
For further information, see Moore's Law Is Dead's coverage off and on over the past year.
I can't see the future, but I can see patterns: the media that reports straight from the industry rumor mill LOVES this "Intel has cancelled its GPUs" story, for whatever reason. I have no particular love for Intel (out of my six current systems, my only Intel box is a cheap NUC from 2018), but at this point, these rumors echo the old joke about economists who "accurately predicted the last nine out of two recessions".
They're still going to be employing some developers for driver maintenance for the sake of their iGPUs, and that might be enough for these cards.
It is crazy to me that a world newly craving GPU architecture for AI, and gamers being largely neglected, that Intel would abandon an established product line.
You still need to fab it somewhere. Intel's fabs have been plagued with issues for years, the AI grifters have bought up a lot of TSMCs allotments and what remains got bought up by Apple for their iOS and macOS lineups, and Samsung's fabs are busy doing Samsung SoCs.
And that unfortunately may explain why Intel yanked everything. What use is a product line that can't be sold because you can't get it produced?
Yet another item on my long list of "why I want to see the AI grift industry burn and the major participants rotting in a prison cell".
If they made an M4 on a card that supported all the same standards and was price competitive, though, that might be a good option.
Linux is not immune to BIOS/UEFI firmware attacks either. Secure Boot, TPM, and LUKS can work well together, but you still depend on proprietary firmware that you do not fully control. LogoFAIL is a good example of that risk, especially in an evil maid scenario involving temporary physical access. I think Apple has tighter control over this layer.
At a certain point, even WSL becomes a more viable deployment platform.