Do I need a 1000W power supply for the RTX 5090?

NVIDIA recommends a 1000W PSU. The card has a 575W board power on its own, and a full system under load can pull 700W or more, so a quality 1000W unit with a native 12V-2x6 connector is the safe choice. Reusing an older or marginal supply with adapters is how people end up with melted connectors.

RTX 5090 Review: Power, or Overkill?

Q: Is the RTX 5090 worth it over the RTX 4090?

For most gamers already on a 4090, no. The RTX 5090 is roughly 30 percent faster at native 4K and around 27 to 35 percent faster in ray tracing, which is real but not transformational for a card that costs more and draws 575 watts. It earns the money if you want native 4K path tracing at high refresh, or if you run local AI and need the 32GB of memory and FP4 support the 4090 cannot offer.

Q: Can the RTX 5090 run 70B parameter language models?

Yes, at 4-bit quantization. A 70B model at Q4 lands around 40GB, which is too large for the 24GB RTX 4090 without offloading, but the RTX 5090's 32GB handles tighter 70B Q4 builds and comfortably runs 32B and 13B models. For 70B at higher precision or several models at once, an Apple M5 Max with up to 128GB of unified memory holds what the 5090 physically cannot, although it generates tokens more slowly.

Q: Is a MacBook with M5 Max better than an RTX 5090 for AI?

It depends on the model size. For models that fit inside 32GB, the RTX 5090 is far faster thanks to roughly 1.79 TB/s of memory bandwidth against the M5 Max's 614 GB/s. For very large models, the M5 Max's 128GB of unified memory lets it run what the 5090 cannot hold at all, at lower speed but in total silence and under about 120 watts for the whole machine.

Q: Can you play modern PC games on an Apple M5 Mac?

Some, but not at the level of a 5090 PC. The native Mac catalog is growing and includes titles like Cyberpunk 2077 and several Resident Evil games, and the M5 GPUs added hardware ray tracing, but most demanding Windows games either do not run natively or run through translation layers with a performance cost. For serious PC gaming the RTX 5090 is in a different league.

// 01 the pitchA relic from the future, and a $2,000 asterisk

Jensen Huang held it up on the CES stage the way he always does, like a relic pulled back from somewhere we have not arrived yet, and told a room of believers it would make everything else feel slow. Then came the line he could not resist, the one about how the more you buy, the more you save, delivered with the practiced smile of a man whose company spent the late nineties thirty days from collapse and now sits among the most valuable on earth. On paper the RTX 5090 lives up to the theatre. It is the fastest thing you can legally bolt into a desktop, and nothing from anyone else is close.

The asterisk arrived in the same breath. NVIDIA's headline claim, that the 5090 is twice the 4090, is true only inside the games and apps that support DLSS 4 and its new Multi Frame Generation, where three of every four frames you see are conjured by AI rather than rendered. Strip that away and the real, raw gap is smaller and far more honest. This review is about that honest number, measured twice. Once for the gamer who just wants frames, and once for the person quietly running language models on the machine under their desk, which is a very 2026 thing to be doing.

The short version: the 5090 is magnificent and slightly absurd. Whether that absurdity is aimed at you depends entirely on which hat you are wearing, and on what you already own.

// 02 the architectureWhat Blackwell actually changed

Generational GPU launches love big numbers, so it helps to know which ones matter. The 5090 is built on NVIDIA's Blackwell architecture, the GB202 die fabricated on TSMC's 4NP process, and it is genuinely large: 21,760 CUDA cores across 170 streaming multiprocessors, against the 4090's 16,384 cores and 128 multiprocessors. That is more of everything, and more is nice, but it is not where the interesting story sits.

The interesting story is memory. The 5090 pairs 32GB of new GDDR7 with a 512-bit bus, and the result is roughly 1.79 terabytes per second of bandwidth, against just over one terabyte per second on the 4090. That is close to an eighty percent jump, and it is the single spec that quietly decides most of this review. High-end gaming at 4K is hungry for bandwidth, and modern AI inference is hungrier still. NVIDIA also widened the L2 cache to 98MB from the 4090's 73MB, gave the card fifth-generation Tensor cores that can run FP4, a 4-bit number format the 4090 cannot touch, and the new fourth-generation RT cores that push ray tracing harder.

Two software-shaped things ride on that silicon. DLSS 4 brings a new transformer-based upscaling model that looks cleaner than what came before, and it brings Multi Frame Generation, a Blackwell-only trick that generates up to three extra frames for every one the GPU truly renders. That is the source of the doubled marketing figure, and we will treat it with the suspicion it deserves later. The honest baseline is the next number down.

RTX 5090 vs RTX 4090: the spec sheet that matters// face-off

Specification	RTX 5090	RTX 4090	The delta
Architecture	Blackwell, GB202	Ada Lovelace, AD102	New generation
CUDA cores	21,760	16,384	+33%
Memory	32GB GDDR7	24GB GDDR6X	+8GB, faster type
Memory bus	512-bit	384-bit	Wider
Bandwidth	~1,792 GB/s	~1,008 GB/s	+78%
L2 cache	98 MB	73 MB	+25 MB
Tensor cores	680, 5th gen, FP4	512, 4th gen	FP4 support
Frame Generation	DLSS 4, Multi (up to 4x)	DLSS 3, single (2x)	Blackwell only
Board power	575 W	450 W	+125 W draw
Launch MSRP	$1,999	$1,599	+$400

Every row but the last two goes to the 5090. The card wins on capability and loses on appetite and price, which is the whole tension of this review in one table. Bandwidth, highlighted, is the spec that drives both the gaming and the AI results below.

// 03 the gamer's verdictHat one: the gamer chasing frames

Put a 5090 in a gaming rig and the first thing you notice is that it refuses to be the bottleneck. At native 4K it averages around thirty percent more frames than a 4090 across a wide spread of games, with the spread itself telling the real tale: some titles gain barely twenty percent, a few stretch past forty or even fifty. Turn on heavy ray tracing and the gap settles around twenty-seven to thirty-five percent. Turn on full path tracing, the most punishing lighting workload in games today, and the card stretches its legs: Cyberpunk 2077 in its Overdrive mode runs roughly thirty-eight percent faster than on a 4090, and Black Myth: Wukong lands near thirty-seven percent.

This is where the 5090 earns its keep as a gaming card. Native 4K path tracing has been a fantasy spec for years, the kind of thing that brought a 5080 to a slideshow around twenty frames a second without help. The 5090 is the first consumer card that makes it genuinely playable, and with the cleaner DLSS 4 upscaler doing the honest work of filling in resolution, it can hold a high refresh in the prettiest games ever shipped. If your dream is Cyberpunk's neon rain or Alan Wake 2's haunted forest at maximum fidelity on a 4K high-refresh panel, nothing else gets you there.

Native 4K performance, RTX 5090 vs RTX 4090// higher is better, no frame generation

Cyberpunk 2077
path tracing

5090 · +38%

4090

Black Myth
Wukong, PT

5090 · +37%

4090

Alan Wake 2
path tracing

5090 · +35%

4090

12-game raster
average

5090 · +30%

4090

Ray tracing
average

5090 · +29%

4090

RTX 5090RTX 4090

Bars show relative frame rates at native 4K, drawn from aggregated launch reviews (KitGuru, GamersNexus, DSOGaming). Per-game gains range from roughly +20% to +50%; path-traced titles sit at the top. These are real rendered frames, before any Multi Frame Generation is switched on.

The Multi Frame Generation asterisk

Now the asterisk. NVIDIA's claim that the 5090 doubles the 4090 leans almost entirely on Multi Frame Generation, and it deserves a clear-eyed look rather than a cheer or a sneer. The technique works: a thirty to forty frame baseline can feel like a hundred plus, motion looks fluid, and in a slow, gorgeous single-player game it is a genuine pleasure. But generated frames are not the same currency as rendered ones. They add a little input latency, they can smear on fast motion, and crucially they do not help if your true frame rate is already low, because the engine still has to produce that one real frame before the AI can invent the next three. It is a smoothing layer, not a substitute for raw power, and reading a "2x" built on it as if it were rendered performance is exactly the kind of marketing multiplier this site refuses to take at face value.

How DLSS 4 Multi Frame Generation fills the gap// 1 rendered frame, 3 invented

For every frame the GPU truly renders (R), DLSS 4 can invent up to three more (AI). The result feels fluid, but the felt responsiveness is still tied to that single rendered frame. Great for fidelity, not a replacement for the underlying horsepower.

So should a 4090 owner upgrade?

For pure gaming, the honest answer is no. A thirty percent average gain is the sort of jump you feel in benchmarks and rarely in your chair, and it shrinks fast as you step down from 4K. At 1440p the lead narrows to around twenty percent because the processor, not the graphics card, starts calling the shots, and at 1080p it nearly evaporates. Paying another two thousand or more, and feeding the thing 575 watts, for a margin you mostly see in a bar chart is a poor trade if you already own the previous champion. The 5090 is for someone building fresh, someone coming from a 3080 or 3090 or an older mid-range card, or someone whose specific obsession is native 4K path tracing at high refresh. If that last sentence is not you, your 4090 is fine and will stay fine for a while.

The uplift shrinks as the resolution drops// 5090 lead over 4090

+30%

1440p

+20%

1080p

~+10%

Below 4K the processor becomes the limit and the world's fastest GPU has nothing to do with its spare capacity. The 5090 is a 4K-and-above instrument. On a 1080p monitor it is money set on fire.

One more thing the gamer should keep in the corner of their eye: the future. The Witcher 4 is being built on Unreal Engine 5 with Lumen, Nanite and ray tracing running at once, and Grand Theft Auto VI arrives on consoles in November 2026 with a PC port expected to follow roughly a year later. When those land, native 4K at maximum settings will get heavy again, and the 5090 is the card best placed to swallow them. That is a reason to buy if you are starting from scratch. It is not, on its own, a reason to throw out a perfectly good 4090 today.

// 04 the AI verdictHat two: the AI tinkerer at the desk

Switch hats. The most interesting thing happening to the 5090 has little to do with games. A growing crowd of developers, researchers and hobbyists run large language models locally, for privacy, for cost, for the simple pleasure of owning the whole stack, and for that crowd the 5090 reads very differently. Here the headline spec is not the core count, it is that eighty percent more memory bandwidth, because token generation is bound by how fast the GPU can stream model weights out of memory, not by how clever the cores are. More bandwidth means more tokens per second, almost in a straight line.

In real, daily use, people report the 5090 running local models roughly twenty-five to thirty-five percent faster than a 4090, with the gap widest on big models and long contexts where the 4090 starts choking on memory traffic. Synthetic tests on smaller models can show even more, well past sixty percent in a few cases, but the lived improvement most people feel is in that quarter-to-a-third range. Real and welcome, not revolutionary.

Local LLM throughput, tokens per second// llama.cpp / Ollama style workloads

8B model
(small)

5090 · ~210 t/s

4090 · ~128 t/s

M5 Max · est.

70B model
Q4, long ctx

5090 · tight fit

M5 Max · ~30-35 t/s

RTX 5090RTX 4090Apple M5 Max (estimated, hatched)

On small models that fit comfortably, the 5090 leads on raw speed. On a 70B model at 4-bit, the 4090's 24GB simply cannot hold it without offloading, so it drops out; the 5090 manages a tight fit, and the M5 Max runs it slower but in total silence. Apple figures are early estimates based on bandwidth and M4 precedent, with independent benchmarks still pending.

The real prize is the 32 gigabytes

Speed is the smaller half of the story. The reason an AI tinkerer wants a 5090 over a 4090 is the jump from 24GB to 32GB, because that extra headroom changes what you can run at all. A 4090 has to quantize aggressively or offload to system memory the moment a model gets serious, and offloading turns a snappy session into a crawl. The 5090's 32GB lets you fit a 70B model at 4-bit, run a 32B model with real context length, or fine-tune a 13B model at full FP16 precision with LoRA adapters without hitting a wall. Add FP4, which the 4090 cannot do at all and which roughly doubles throughput over FP8 once the tooling matures, and the 5090 becomes the cheapest sensible on-ramp to serious local AI. Not because it is the fastest silicon in absolute terms, but because the memory finally stops getting in your way.

What fits: model memory footprint vs available memory// 4-bit unless noted

Bar length is the memory a model needs; the vertical lines are what each device offers. Everything left of a line fits on that device. The 5090's 32GB clears the practical sweet spot up to a tight 70B at 4-bit; the M5 Max's 128GB reaches models neither NVIDIA consumer card can hold, just more slowly.

// 05 the comparisonVersus Apple's M5 Macs: two philosophies, one question

This is the comparison everyone actually wants, and it is the most misunderstood, because a $2,000 graphics card and a complete Apple computer are not the same kind of object. The 5090 is a component you feed with a 1000-watt power supply inside a tower you build. An M5, M5 Pro, M5 Max or Mac mini is a finished, silent machine. Comparing them only makes sense once you say what for. So we will say it: gaming, and AI. Two questions, two very different answers.

Apple's M5 generation is a real leap. The base M5, which arrived in late 2025, put a Neural Accelerator inside every GPU core and pushed memory bandwidth to around 153 GB/s. In March 2026 the M5 Pro and M5 Max followed, built on a new Fusion Architecture that fuses two dies into one chip, with the M5 Max scaling to a 40-core GPU, up to 128GB of unified memory and 614 GB/s of bandwidth. Apple claims more than four times the GPU's AI compute over the previous generation. The Mac mini, for now, is still on the older M4, with an M5 version widely expected to arrive around mid-2026. So when we say "Mac mini" today we mean the current, very affordable M4 box, with the caveat that its successor is close.

For gaming, it is not a contest

Let us be blunt, because the brand demands honesty over diplomacy. For serious PC gaming, the 5090 obliterates every Mac on this page, and not mainly on raw power. The deeper problem is the library. Most demanding Windows games either do not run natively on macOS or run through translation layers that cost you performance. Apple has done real work here, hardware ray tracing arrived with this GPU family and the native catalog now includes Cyberpunk 2077 and several Resident Evil titles, but the gap to a 5090 driving native 4K path tracing is not a gap you close with a driver update. If gaming is the job, you buy the PC. Full stop.

For AI, the question finally gets interesting

Here the answer flips on a single hinge: does your model fit in 32GB? If it does, the 5090 wins, and it wins comfortably, because roughly 1.79 TB/s of bandwidth against the M5 Max's 614 GB/s is close to a threefold advantage on the thing that sets token speed. For a 7B, 13B or 32B model, NVIDIA stays two to three times faster, full stop. But if your model does not fit in 32GB, the hinge swings the other way. The M5 Max's 128GB of unified memory can simply hold a 70B model at higher precision, or a 120B model at 4-bit, or several models at once, things the 5090 physically cannot load without spilling to system memory and slowing to a walk. The Mac runs them more slowly, an estimated thirty to thirty-five tokens a second on a 70B at 4-bit, but it runs them, silently, on a laptop you can close and carry.

RTX 5090 vs RTX 4090 vs the Apple M5 family// who wins for what, per row

What you care about	RTX 5090	RTX 4090	M5 Max	M5 Pro	M5 / Mac mini
What it is	$2k GPU	$2k GPU	Pro laptop SoC	Pro laptop SoC	Mainstream SoC
Memory for AI	32GB	24GB	up to 128GB	up to 64GB	up to 32GB
Memory bandwidth	~1,792 GB/s	~1,008 GB/s	614 GB/s	307 GB/s	~153 GB/s
Small models (≤32GB)	Fastest	Fast	Slower	Slower	Slowest
Huge models (70B+)	Tight / offloads	Cannot fit	Holds it	Partial	No
PC gaming	Untouchable	Excellent	Limited	Limited	Light
Power, whole system	700W+	600W+	~120W	~80W	~30-65W
Portability	Tower only	Tower only	Laptop	Laptop	Laptop / tiny box
Roughly costs	$2,000+ card	$1,800+ card	$3,500+ machine	$2,500+ machine	$599-1,099 machine

No single column wins everything, which is the point. The 5090 owns speed and gaming; the M5 Max owns capacity, efficiency and portability; the cheaper M5 and Mac mini win on price for lighter work. Pick the row that describes your life, then read across.

Why unified memory is the whole trick

The reason a laptop can ever beat a 575-watt monster at anything is architectural, and it is worth understanding because it shapes the next decade. A PC keeps two separate pools of memory, the system RAM the processor uses and the dedicated VRAM on the graphics card, and data has to be copied across the PCIe bus between them. Apple's chips use one unified pool that the processor, GPU and Neural Engine all read from directly, with no copying. For AI, where the model weights are enormous and constantly in motion, that means a Mac's entire memory is addressable by the GPU. A 5090 has fast memory, but only 32GB of it. An M5 Max has slower memory, but up to 128GB of it, all available to the work. Different bets on the same problem.

Discrete VRAM vs unified memory// why a Mac can hold what a 5090 cannot

The 5090 has the faster memory but a hard 32GB ceiling, with a PCIe copy between the processor and the card. Apple shares one large pool across everything, so the whole 128GB is available to the model. Speed versus capacity, drawn out.

// 06 the roomPower, heat, and the bill you forgot about

The spec sheet says 575 watts, and that number has consequences the marketing slides skip. NVIDIA recommends a 1000-watt power supply, and a full system under load can pull seven hundred watts or more, which is a space heater you happen to play games on. The redesigned Founders Edition is a real engineering win here, a genuinely compact two-slot card with dual flow-through fans and a vapor chamber that keeps the core impressively cool, though memory temperatures run warmer and the fans are audible when you push it.

The connector deserves a plain warning. This generation continues with the 12V high-power plug, and the previous round of cards taught everyone an expensive lesson about what happens when it is seated badly or fed through tired adapters: melted connectors and dead hardware. Use a quality supply with a native cable, push it in until it clicks, and do not improvise. Set against an M5 Max laptop that does its AI work under roughly 120 watts for the entire machine, in silence, the 5090's appetite is the clearest expression of its character. It is not subtle, and it was never trying to be.

Watts to do the work, whole system under load// lower is calmer

RTX 5090 PC

~700-850W

RTX 4090 PC

~600-700W

M5 Max laptop

~120W

Mac mini (M4)

~30-65W

The performance crown comes with a power bill and a warmer room. For always-on AI work that runs for hours, that difference compounds into real money and real heat, which is a quiet point in Apple's favour and an honest mark against the 5090.

// 07 the verdictSo, power or overkill?

Both, and which one depends on you. The RTX 5090 is the finest consumer graphics card ever made and an exercise in excess at the same time, and pretending it has to be only one of those is how reviews end up dishonest. It is the only card that turns native 4K path tracing from a dream into a playable reality, and it is the cheapest sensible door into serious local AI thanks to that 32GB and FP4. It is also $2,000 or more, draws 575 watts, and offers a 4090 owner a margin they will mostly meet in graphs. So we will not give it one score. We will give it two.

// hat one · the gamer

8.5/10

The best, by a clear margin, for native 4K and path tracing at high refresh. But the win shrinks below 4K, and if you own a 4090 the upgrade is hard to justify today. Buy it building fresh, or chasing the prettiest games at the highest fidelity. Otherwise, wait.

// hat two · the AI tinkerer

9/10

The cheapest serious on-ramp to local AI, and the 32GB plus FP4 is the reason, not the raw speed. Sublime for anything that fits in memory. If you must run truly enormous models, an M5 Max with 128GB is a different and entirely valid answer. Know which you are.

Overkill for the person on a 1080p monitor who upgraded last year. A genuine, slightly ridiculous joy for the one running path-traced cities at 4K, or fine-tuning a model on a Tuesday night for the fun of it. The 5090 was built for the second person. If that is you, you already knew it before you reached this paragraph. If it is not, your wallet just dodged a 575-watt bullet, and you are welcome.

Where does all this silicon ambition lead next? Straight into the data centre, where the same Blackwell lineage scales to racks that make a 5090 look like a stocking filler. We took that walk in our field guide to NVIDIA's Vera Rubin NVL72, the platform built for the AI buildout these consumer cards only hint at. And if you want to know where the obsession with frames and fidelity began, it runs back through the 3dfx Voodoo era and the home computers that lit the fuse, a story we tell in our look back at the Commodore Amiga 500.

// 08 questionsFrequently asked

Is the RTX 5090 worth it over the RTX 4090?

For most gamers already on a 4090, no. It is about thirty percent faster at native 4K and twenty-seven to thirty-five percent faster in ray tracing, real but not transformational for a card that costs more and draws 575 watts. It earns the money if you want native 4K path tracing at high refresh, or if you run local AI and need the 32GB and FP4 the 4090 cannot offer.

Can the RTX 5090 run 70B language models?

Yes, at 4-bit. A 70B model at Q4 lands near 40GB, too large for the 24GB 4090 without offloading, but the 5090's 32GB handles tighter 70B Q4 builds and comfortably runs 32B and 13B models. For 70B at higher precision or several models at once, an M5 Max with up to 128GB of unified memory holds what the 5090 cannot, though it generates tokens more slowly.

Do I need a 1000W power supply?

NVIDIA recommends one, and it is the safe call. The card alone is 575 watts and a full system under load can pass 700, so a quality 1000W unit with a native 12V-2x6 cable is the right choice. Reusing a marginal supply with adapters is how people end up with melted connectors.

Is a MacBook with M5 Max better for AI than a 5090?

It depends on model size. For anything that fits in 32GB, the 5090 is far faster thanks to roughly 1.79 TB/s of bandwidth against the M5 Max's 614 GB/s. For very large models, the M5 Max's 128GB of unified memory runs what the 5090 cannot hold at all, slower but silent and under about 120 watts for the whole machine.

Can you game on an Apple M5 Mac?

Some, but not at a 5090 PC's level. The native Mac catalog is growing and the M5 GPUs added hardware ray tracing, but most demanding Windows games either do not run natively or run through translation layers with a performance cost. For serious PC gaming the 5090 is in a different league.

RTX 5090 Review: Power, or Overkill?

// 01 the pitchA relic from the future, and a $2,000 asterisk

// 02 the architectureWhat Blackwell actually changed

// 03 the gamer's verdictHat one: the gamer chasing frames

The Multi Frame Generation asterisk

So should a 4090 owner upgrade?

// 04 the AI verdictHat two: the AI tinkerer at the desk

The real prize is the 32 gigabytes

// 05 the comparisonVersus Apple's M5 Macs: two philosophies, one question

For gaming, it is not a contest

For AI, the question finally gets interesting

Why unified memory is the whole trick

// 06 the roomPower, heat, and the bill you forgot about

// 07 the verdictSo, power or overkill?

// 08 questionsFrequently asked

// read nextMore from Silicon Tales

RTX 5090 Review:
Power, or Overkill?