The first real-time voice assistant
In the midst of the demos & discussion about OpenAI’s GPT-4o voice assistant, Kyutai swooped in to release the first real-time AI voice assistant model and a pretty slick demo (Moshi). Chris & Daniel discuss what this more open approach to a voice assistant might catalyze. They also discuss recent changes to Gartner’s ranking of GenAI on their hype cycle.
Matched from the episode's transcript 👇
Chris Benson: I think there’s certainly a chance at it, and I would argue – it’s the same argument I’ve made in previous shows where we talked on similar topics, is that we’re seeing… As the AI industry is has been maturing these years at an incredibly rapid pace, but we’re still seeing many of the things occurring that we saw when the software world was really maturing over several decades. And the place where open source has really, really worked are in common touchpoints where all organizations or many organizations need a common thing. And they might build something differentiated on top of that, for their revenue, to drive profitability… But there’s so much that is underneath that point of differentiation that they and many other organizations can get the benefit out of a lot of effort, a lot of work; a lot of times they’ll have paid employees do it…
So there’s a point where working together and doing open stuff makes sense for business, and it drives profitability. It may not be your single point of differentiation, but if it’s anything under that, why not? Why not share the costs, and pool expertise for the best possible foundations? And so what I’m hoping is that we continue to see that play out in the AI space. We’re seeing – if you look at Hugging Face, we’ve already talked about the fact that a couple of months ago they announced that they were hosting a million models; those are all open source. Really, really impressive. And so I think that there is a good chance that a vibrant, open community around AI can and will continue, and it will have a lot of corporate players involved in it. So I’m very optimistic in that way.
Break: [00:13:19.26]