📅 AI21 Jamba 1.5, DIY Meme Faces, 8yo codes with AI and a Doomsday

ThursdAI - The top AI news from the past week

📅 AI21 Jamba 1.5, DIY Meme Faces, 8yo codes with AI and a Doomsday LLM Device?!

Listen now

Description

Hey there, Alex here with an end of summer edition of our show, which did not disappoint. Today is the official anniversary of stable diffusion 1.4 can you believe it? It's the second week in the row that we have an exclusive LLM launch on the show (after Emozilla announced Hermes 3 on last week's show), and spoiler alert, we may have something cooking for next week as well! This edition of ThursdAI is brought to you by W&B Weave, our LLM observability toolkit, letting you evaluate LLMs for your own use-case easily Also this week, we've covered both ends of AI progress, doomerist CEO saying "Fck Gen AI" vs an 8yo coder and I continued to geek out on putting myself into memes (I promised I'll stop... at some point) so buckle up, let's take a look at another crazy week: TL;DR * Open Source LLMs * AI21 releases Jamba1.5 Large / Mini hybrid Mamba MoE (X, Blog, HF) * Microsoft Phi 3.5 - 3 new models including MoE (X, HF) * BFCL 2 - Berkley Function Calling Leaderboard V2 (X, Blog, Leaderboard) * NVIDIA - Mistral Nemo Minitron 8B - Distilled / Pruned from 12B (HF) * Cohere paper proves - code improves intelligence (X, Paper) * MOHAWK - transformer → Mamba distillation method (X, Paper, Blog) * AI Art & Diffusion & 3D * Ideogram launches v2 - new img diffusion king 👑 + API (X, Blog, Try it) * Midjourney is now on web + free tier (try it finally) * Flux keeps getting better, cheaper, faster + adoption from OSS (X, X, X) * Procreate hates generative AI (X) * Big CO LLMs + APIs * Grok 2 full is finally available on X - performs well on real time queries (X) * OpenAI adds GPT-4o Finetuning (blog) * Google API updates - 1000 pages PDFs + LOTS of free tokens (X) * This weeks Buzz * Weights & Biases Judgement Day SF Hackathon in September 21-22 (Sign up to hack) * Video * Hotshot - new video model - trained by 4 guys (try it, technical deep dive) * Luma Dream Machine 1.5 (X, Try it) * Tools & Others * LMStudio 0.0.3 update - local RAG, structured outputs with any model & more (X) * Vercel - Vo now has chat (X) * Ark - a completely offline device - offline LLM + worlds maps (X) * Ricky's Daughter coding with cursor video is a must watch (video) The Best of the Best: Open Source Wins with Jamba, Phi 3.5, and Surprise Function Calling Heroes We kick things off this week by focusing on what we love the most on ThursdAI, open-source models! We had a ton of incredible releases this week, starting off with something we were super lucky to have live, the official announcement of AI21's latest LLM: Jamba. AI21 Officially Announces Jamba 1.5 Large/Mini – The Powerhouse Architecture Combines Transformer and Mamba While we've covered Jamba release on the show back in April, Jamba 1.5 is an updated powerhouse. It's 2 models, Large and Mini, both MoE and both are still hybrid architecture of Transformers + Mamba that try to get both worlds. Itay Dalmedigos, technical lead at AI21, joined us on the ThursdAI stage for an exclusive first look, giving us the full rundown on this developer-ready model with an awesome 256K context window, but it's not just the size – it’s about using that size effectively. AI21 measured the effective context use of their model on the new RULER benchmark released by NVIDIA, an iteration of the needle in the haystack and showed that their models have full utilization of context, as opposed to many other models. “As you mentioned, we’re able to pack many, many tokens on a single GPU. Uh, this is mostly due to the fact that we are able to quantize most of our parameters", Itay explained, diving into their secret sauce, ExpertsInt8, a novel quantization technique specifically designed for MoE models. Oh, and did we mention Jamba is multilingual (eight languages and counting), natively supports structured JSON, function calling, document digestion… basically everything developers dream of. They even chucked in citation generation, as it's long context can contain full documents, your RAG app may not

More Episodes

See all »

📆 ThursdAI - Nov 14 - Qwen 2.5 Coder, No Walls, Gemini 1114 👑 LLM, ChatGPT OS integrations & more AI news

This week is a very exciting one in the world of AI news, as we get 3 SOTA models, one in overall LLM rankings, on in OSS coding and one in OSS voice + a bunch of new breaking news during the show (which we reacted to live on the pod, and as we're now doing video, you can see us freak out in real...

Published 11/15/24

ThursdAI - The top AI news from the past week

Published 11/15/24

📆 ThursdAI - Nov 7 - Video version, full o1 was given and taken away, Anthropic price hike-u, halloween 💀 recap & more AI news

👋 Hey all, this is Alex, coming to you from the very Sunny California, as I'm in SF again, while there is a complete snow storm back home in Denver (brrr). I flew here for the Hackathon I kept telling you about, and it was glorious, we had over 400 registered, over 200 approved hackers, 21 teams...

Published 11/08/24