Hello everyone. I want to share a new search tool I’ve been developing, which I hope will be a helpful resource. It allows you to find suttas by context using natural language. For example, if you can’t remember a sutta title but recall details like “The Buddha was speaking with ‘x’ person about ‘y’ topic,” you can still locate it easily. Or find sutta about a specific topic.
Basically, It uses vector embedding to map semantic “meaning” to each sutta, then your search query gets embedded to find a match.
Some of examples of search types that work well are things like:
“What is the sutta where the Buddha talks about de difference between a monk that lives in the forest and a monk that lives in a village?”
“The Buddha talks to King Pasenadi”
“The Buddha compares right effort to tuning a lute”
It’s still not perfect, but I plan to further break up the suttas into chunks for embedding so different themes don’t lost in longer suttas.
I’d like to thank Venerables Khemaratana and Khemarato who have been extremely helpful with feedback and suggestions so far. Any additional feedback is appreciated.
I agree though. I was also thinking about adding different filters or things like required keywords to make it easier to find specific phrases. I think there are a lot different directions it could go. My goal is to make it as widely applicable as possible.
That’s a good call out. You’re right that I did intend to use show the api url since it is technically the source, but I see how that’s pretty useless in hindsite . Maybe I’ll just remove the origin urls from the individual sutta pages altogether since I’m giving credit for the sources elsewhere on the site now as well.
I mean, it would be trivial to convert the link going to the live website? I mean sometimes it’s handy to have a straight link to the SC, to look for parallels, etc.