📅 ThursdAI - Mar 7 - Anthropic gives us Claude 3, Elon vs OpenAI, Inflection 2.5 with Pi, img-2-3D from Stability & More AI news
Description
Hello hello everyone, happy spring! Can you believe it? It's already spring!
We have tons of AI news for you to cover, starting with the most impactful one, did you already use Claude 3? Anthropic decided to celebrate Claude 1's birthday early (which btw is also ThursdAI's birthday and GPT4 release date, March 14th, 2023) and gave us 3 new Clauds! Opus, Sonnet and Haiku.
TL;DR of all topics covered:
* Big CO LLMs + APIs
* 🔥 Anthropic releases Claude Opus, Sonnet, Haiku (Announcement, try it)
* Inflection updates Pi 2.5 - claims GPT4/Gemini equivalent with 40% less compute (announcement)
* Elon sues OpenAI (link)
* OpenAI responds (link)
* ex-Google employee was charged with trading AI secrets with China (article)
* Open Source LLMs
* 01AI open sources - Yi 9B (Announcement)
* AnswerAI - Jeremy Howard, Johno & Tim Detmers - train 70B at home with FSDP/QLoRA (X, Blog)
* GaLORE - Training 7B on a single consumer-grade GPU (24GB) (X)
* Nous open sources Genstruct 7B - instruction-generation model (Hugging Face)
* Yam's GEMMA-7B Hebrew (X)
* This weeks Buzz
* Weights & Biases is coming to SF in April! Our annual conference called Fully Connected is open for registration (Get your tickets and see us in SF)
* Vision & Video
* Vik releases Moondream 2 (Link)
* Voice & Audio
* Suno v3 alpha is blowing minds (Link)
* AI Art & Diffusion & 3D
* SD3 research paper is here (Link)
* Tripo + Stability release TripoSR - FAST image-2-3D (link, Demo, FAST demo)
* Story how I created competition of inference providers to get us sub 1.5s playground image gen (X)
Big CO LLMs + APIs
Anthropic releases Claude 3 Opus, Sonnet and Haiku
This was by far the biggest news of this week, specifically because, the top keeps getting saturated with top of the line models! Claude Opus is actually preferable to many folks in blind studies over some GPT-4 features, and as we were recording the pod, LMSys released their rankings and Claude Opus beats Gemini, and is now 3rd in user preference on the LMSys rank.
There release is vast, they have announced 3 new models but only gave us access to 2 of them teasing that Haiku is much faster / cheaper than other options in that weight class out there.
In addition to being head to head with GPT-4, Claude 3 is now finally also multimodal on inputs, meaning it can take images, understand graphs and charts. They also promised significantly less refusals and improved accuracy by almost 2x.
One incredible thing that Claude always had was 200K context window, and here they announced that they will be supporting up to 1M, but for now we still only get 200K.
We were also promised support for function calling and structured output, but apparently that's "coming soon" but still great to see that they are aiming for it!
We were all really impressed with Claude Opus, from folks on stage who mentioned that it's easier to talk to and feels less sterile than GPT-4, to coding abilities that are not "lazy" and don't tell you to continue writing the rest of the code yourself in comments, to even folks who are jailbreaking the guardrales and getting Claude to speak about the "I" and metacognition.
Speaking of meta-cognition sparks, one of the prompt engineers on the team shared a funny story about doing a needle-in-haystack analysis, and that Claude Opus responded with I suspect this pizza topping "fact" may have been inserted as a joke or to test if I was paying attention
This split the X AI folks in 2, many claiming, OMG it's self aware, and many others calling for folks to relax and that like other models, this is still just spitting out token by token.
I additional like the openness with which Anthropic folks shared the (very simple but carefuly crafted) system prompt
My personal take, I've always liked Claude, even v2 was great until they nixed the long context for the free tier. This is a very strong viable alternative for GPT4 if you don't need DALL-E or code interpreter features, or the GPTs store or the voice fea
This week is a very exciting one in the world of AI news, as we get 3 SOTA models, one in overall LLM rankings, on in OSS coding and one in OSS voice + a bunch of new breaking news during the show (which we reacted to live on the pod, and as we're now doing video, you can see us freak out in real...
Published 11/15/24
👋 Hey all, this is Alex, coming to you from the very Sunny California, as I'm in SF again, while there is a complete snow storm back home in Denver (brrr).
I flew here for the Hackathon I kept telling you about, and it was glorious, we had over 400 registered, over 200 approved hackers, 21 teams...
Published 11/08/24