A fan-made search engine for the WAN show podcast by LMG.
Written in Svelte.
Written in Golang using Gin.
Using a SQLite database and the driver by mattn.
The algorithm currently uses these techniques to rank documents. (in this order)
- Inverted Index
- Cosine similarity
- Word proximity
- Title weight (If the title includes a query term it is rated higher)
When retrieving relevent documents,
And uses a simple LIKE
sql statement in all_dialogues table for quoted search.
Frontend is hosted in Cloudflare pages.
The API is running in EC2. Uses Cloudflare proxy.
The bottleneck is the database, I tried indexing a bunch of stuff but this is as fast as i could make it. If anyone have any suggestions to make reads faster, I'm all ears.