
In this video I show you the new multimodal model by Meta AI that allows you to input speech or text and get a host of different outputs from speech-to-speech translation to speech-to-text translation and more. Links: https://ai.meta.com/blog/seamless-m4t/ https://seamless.metademolab.com/demo https://github.com/facebookresearch/seamless_communication https://dl.fbaipublicfiles.com/seamless/seamless_m4t_paper.pdf https://huggingface.co/spaces/facebook/seamless_m4t
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Weekly deep dives on AI agents, coding tools, and building with LLMs - delivered to your inbox.
Free forever. No spam.
Subscribe Free
New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.