Skip to content

Latest commit

 

History

History
43 lines (23 loc) · 1.16 KB

README.md

File metadata and controls

43 lines (23 loc) · 1.16 KB

WANsearch 🔍

A fan-made search engine for the WAN show podcast by LMG.

Frontend

Written in Svelte.

Backend

Written in Golang using Gin.

Database

Using a SQLite database and the driver by mattn.

More info.

How it works

The algorithm currently uses these techniques to rank documents. (in this order)

  • Inverted Index
  • Cosine similarity
  • Word proximity
  • Title weight (If the title includes a query term it is rated higher)

When retrieving relevent documents,

And uses a simple LIKE sql statement in all_dialogues table for quoted search.

Infrastucture

Frontend is hosted in Cloudflare pages.

The API is running in EC2. Uses Cloudflare proxy.

Know issues

Speed when searching by relevance

The bottleneck is the database, I tried indexing a bunch of stuff but this is as fast as i could make it. If anyone have any suggestions to make reads faster, I'm all ears.