I spent over 35 years as a high tech exec at HPE, IBM, and AMD. Now I love to learn and share the amazing hardware and services being built to enable Artificial Intelligence, the next big thing in technology.
Chasing Nvidia is hard; Team Green simultaneously countered with new software that can triple AI performance. Consequently, AMD ’s performance claims are already out of date. And AMD compared it against the older Hopper, not Blackwell which will ship long before the MI325 .
AMD continues to make excellent progress on the hardware and software front for advancing AI. However, AMD’s performance claims are unrealistic since they again did not use Nvidia’s optimization software. Nonetheless, the MI325 is fast and boasts a huge amount of HBM3E .Before we dive into the AI accelerator, let’s look at the new 5th-gen EPYC CPU or Turin they just announced to see if they are maintaining their lead over Intel Xeon. The fifth-generation EPYC CPU offers up to 192 efficiency cores for web apps and up to 192 cores for performance-demanding apps. However, we note that AMD again chose to forgo on-chip AI acceleration, which Intel added to Xeon for the last two generations. It seems strange, but I suspect this is what the hyperscalers told AMD they preferred. Nonetheless, we expect AMD to continue increasing its market share in data center CPUs.AMD CEO Dr. Lisa Su kicked off the AI portion of the event by updating the company’s projected market size for AI accelerators in the data center to $500B by 2028. At last December’s launch for the MI300, she predicted the market would exceed $400 billion by 2027, so its not clear that this is much of an increase. But while AMD investors sold, driving down AMD’s price by over 4.5%, Nvidia reached a near record high, up over 1%.Northern Lights Forecast: Geomagnetic Storm May Cause Aurora Borealis To Be Visible In These States Tonight The primary difference between the MI300 and MI325 is the amount of HBM3, which we assume are stacked 12 DDR chips high. The 256GB of HBM is 80% more memory than on the Nvidia H100 and 50% more than the H200. This additional memory can matter a lot, as models get larger. Meta said on stage that 100% of their inference processing on Llama 405B is being serviced by the MI300X. I wonder if that will remain the case now that Nvidia announced on the same day that they increased performance of Llama 405B on the H200 by another 50%.How fast is the new MI325X? AMD claimed that the 325 is 40% faster than the soon-to-be-old H200 for certain inference workloads. If history is a guide, we can be certain this benchmark did not use Nvidia’s optimizing software which can easily increase performance by 2-4 times.AMD also touted their ROCm 6.2 software which essentially replaces Nvidia’s CUDA libraries. Surprisingly. AMD did not update ROCm with anything like TensorRT LLM, which can again increase performance dramatically.And AMD teased next years MI350 series, which they claim will have better performance, and will support FP4, which is not supported on MI325 but is supported on Blackwell.If you don’t think a developer would use Nvidia’s optimization software to increase performance by 2-4X , then one can conclude the MI325 is faster than the H200. If you use AMD’s optimizations in ROCm. The larger HBM will attract a lot of users, however, like Meta. We will have to wait to see how much the MI350 can improve the contrast with Blackwell, which AMD ignored in the event. Disclosures: This article expresses the opinions of the author and is not to be taken as advice to purchase from or invest in the companies mentioned. My firm, Cambrian-AI Research, is fortunate to have many semiconductor firms as our clients, including BrainChip, Cadence, Cerebras Systems, D-Matrix, Esperanto, Groq, IBM, Intel, Micron, NVIDIA, Qualcomm, Graphcore, SImA.ai, Synopsys, Tenstorrent, Ventana Microsystems, and scores of investors. I have no investment positions in any of the companies mentioned in this article. For more information, please visit our website atOur community is about connecting people through open and thoughtful conversations. We want our readers to share their views and exchange ideas and facts in a safe space.Insults, profanity, incoherent, obscene or inflammatory language or threats of any kindContinuous attempts to re-post comments that have been previously moderated/rejectedAttempts or tactics that put the site security at riskProtect your community.
AMD Nvidia Epyc CPU AI H200 MI325 MI350
United States Latest News, United States Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
CARV Unveils CARV Labs, a $50M Accelerator to Fund Decentralized Data EcosystemCrypto Blog
Read more »
Following 3 Unicorns in 2 Years, Web3 Accelerator Beacon Launches Its Largest Cohort to DateCrypto Blog
Read more »
Protocol Village: Copper.co to Custody CoinDesk 20 Members, CARV Launches $50M AcceleratorBradley Keoun is the managing editor of CoinDesk's Tech & Protocols team. He owns less than $1,000 each of several cryptocurrencies.
Read more »
Flagstar Bank Launches Unique Fintech Accelerator ProgramThis article explores Flagstar Bank's innovative fintech accelerator program, highlighting its unique approach that blends elements of CVC, incubation, and acceleration. The program focuses on mentoring and supporting promising fintech entrepreneurs while fostering long-term commercial relationships.
Read more »
AMD launches AI chip to rival Nvidia's BlackwellAdvanced generative AI such as OpenAI’s ChatGPT requires massive data centers full of GPUs, which has created demand for more companies to provide AI…
Read more »
AMD Launches AI Chip To Challenge Nvidia In Data Center MarketAMD unveiled a new artificial intelligence chip, the Instinct MI325X, designed to compete directly with Nvidia's data center graphics processors (GPUs). The launch sets the stage for a battle in the booming AI market, where AMD aims to challenge Nvidia's dominance.
Read more »
