
In this video, I dive into Moshi, a new open source model by a research lab in Paris called Kyutai. This innovative model competes with GPT-4's voice capabilities and is the first speech-in and speech-out open-source model Developed in just six months by a team of eight, Moshi supports 70 different emotions and styles, and allows for interaction with low latency. Link: https://kyutai.org 00:00 Introduction to Moshi: The Open Source GPT-4 Competitor 00:26 Live Demonstration of Moshi's Capabilities 01:50 Key Features and Technical Details of Moshi 03:22 Training and Deployment Insights 05:38 Future Prospects and Final Thoughts 06:56 Conclusion and Call to Action
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Weekly deep dives on AI agents, coding tools, and building with LLMs - delivered to your inbox.
Free forever. No spam.
Subscribe Free
New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.