Jonathan Ross (@JonathanRoss321) on X (original) (raw)

Double the World's AI Compute Chief Software Architect @ Nvidia, Founder of Groq, Creator of the LPU & Google's TPU

What can you do with Llama quality and Groq speed? You can do Instant. That's what. Try Llama 3.1 8B for instant intelligence on groq.com.
We built the region’s largest inference cluster in Saudi Arabia in 51 days and we just announced a $1.5B agreement for Groq to expand our advanced LPU-based AI inference infrastructure. Build fast.
Anyone asking if AI is a bubble hasn't tried it
What are we doing with this capital? Originally we intended to raise $300M which was going to allow us to deploy 108,000 LPUs into production by end of Q1 2025. We raised 2x that, so we're also expanding our cloud and core engineering teams. We're hiring!

Cisco and Samsung are backing AI chip startup Groq at a $2.8 billion valuation trib.al/SpZCp30
Nvidia hit 100k developers in 7 years. Our goal was to hit 100k developers in 7 weeks. It's been 6 weeks, and...
What can you do with Llama quality and Groq speed? Instant. That's what. 3 months back: Llama 8B running at 750 Tokens/sec Now: Llama 70B model running at 3,200 Tokens/sec We're still going to get a liiiiiiitle bit faster, but this is our V1 14nm LPU - how fast will V2 be? 😉
Groq just set a new speed record. And, we still plan to get a liiiiiiitle bit faster still 😉

More speed on our existing 14nm silicon. GA soon. Reach out if you want to go fast!
Prediction: AI will displace social drinking within 5 years Just as alcohol is a social disinhibitor, like the Steve Martin movie Roxanne, people will use AI powered earbuds to help them socialize. At first we'll view it as creepy, but it will quickly become superior to alcohol
(1/5) Everyone at Groq has one of these challenge coins on them. It’s how we create alignment One side says its 25 million, because we're going to get to 25 million tokens per second by the end of the year On the other side, it says, “Make it real. Make it now. Make it wow.”
When you make compute cheaper do people buy more? Yes. It's called Jevons Paradox and it's a big part of our business thesis. In the 1860s, an Englishman wrote a treatise on coal where he noted that every time steam engines got more efficient people bought more coal. 🧵(1/3)
What do
@GroqInc
's LPUs cost? So much curiosity! We're very comfortable with this pricing and performance - and no, the chips/cards don't cost anywhere near $20,000 😂#Groqspeed
350,000,000 downloads of an LLM is nuts! How long did it take Linux to get to that number?

Jonathan Ross, Founder & CEO, Groq: “Open-source wins. Meta is building the foundation of an open ecosystem that rivals the top closed models and at Groq we put them directly into the hands of the developers—a shared value that’s been fundamental at Groq since our beginning. To