Daniel & Chris sit down with Denis Yarats, Co-founder & CTO at Perplexity, to discuss Perplexity’s sophisticated AI-driven answer engine. Denis outlines some of the deficiencies in search engines, and how Perplexity’s approach to information retrieval improves on traditional search engine systems, with a focus on accuracy and validation of the information provided.
Matched from the episode's transcript 👇
Denis Yarats: Yeah, that’s definitely been very fascinating almost two years, I guess. I think in general web search – so what do you want to ultimately get as an answer to your question, right? So the current iteration that we have with Google and all the other classical search engines is an approximation of to get to this point, right? So you your question, you ask it as a formal query, and then you get a bunch of documents that are very relevant, but you have to still do additional work to get to the bottom of it. So you have to scan through the documents, you have to for yourself understand what “Is this a true answer? A not true answer?”, you have to trust the information. And as you can imagine, it’s a lot of work, especially if you’re trying to search for something very complicated, maybe things that are not so obvious.
And wouldn’t it be nice to avoid that step, where you ask the question and you get an answer right away. So that’s the ultimate destination where we’re trying to get. Obviously, it’s been tough to get there over the last decade or so, there has been a lot of work, but it never quite worked. There is this much higher level of hallucinations, much higher level of maybe not perfect synthesis of the information… You basically get a Frankenstein… So instead of a coherent and nice, easily parsable and readable answer, you get some just basically extracted pieces of the information and just concatenated together, so not very pleasant. And it’s funny that when we started – so one of our angel investor was Jeff Dean; he requires no introduction. And he was saying Google actually wanted to always build something this, but because they had such high expectations for accuracy, because millions and billions of users are using Google… And if you hallucinate 1% of the time, you’re gonna get a lot of unhappy people. And so they were never able to – because the models were not as strong as they are right now, they were never able to get to just 99.9% of accuracy. And that’s why like this work never panned out.
[00:06:20.00] But something great happened in 2022. When we started our company, both myself and Aravind, my co-founder, we come from academics, so we’ve been doing a lot of research in language modeling, reinforcement learning and stuff like that… And he was actually at Open AI at that time. We’ve been very literally following improvements of GPT models, GPT 2 and then GPT 3; that’s where it actually got very interesting. And it became obvious there was going to be something there. And this was primarily the motivation for us to start a company.
We wanted to build an answer engine from the get go, but it was very ambitious. I remember we would go to the investors and say “Oh, we’re going to build a search engine”, and they’re looking at you like you’re crazy, which makes a lot of sense. They’re just like “Oh, there’s Google ready.” And they had a fair point. But we still weren’t very discouraged by that. We knew there is something there, and we started prototyping.
So the first version of Perplexity we actually created as a slug board, or Discord bot, where it was a very primitive combination of a search engine plus - at that point it was DaVinci 2 models; it was still pre-ChatGPT. And it worked much better than I expected. It was very quick; we put up this demo in a couple of days. And you could already see that in certain cases it is very helpful.
Because this was a very early company, we were trying to hire one of our first engineers, and we didn’t know how to organize the insurance for him… And so we actually used this bot to ask those questions about insurance. Because if you go to Google and start asking the questions about insurance, you’re gonna get a lot of ads, and you’re just gonna get very quickly disappointed.