NSG with ADA-NNS

This repository is NSG with greedy search method (baseline) and ADA-NNS.

Please refer to original readme.

Building Instruction

Prerequisites

GCC 4.9+ with OpenMP
CMake 2.8+
Boost 1.55+
TCMalloc

IMPORTANT NOTE: this code uses AVX-256 intructions for fast distance computation, so your machine MUST support AVX-256 intructions, this can be checked using cat /proc/cpuinfo | grep avx2.

Compile On Ubuntu/Debian

Install Dependencies:

$ sudo apt-get install g++ cmake libboost-dev libgoogle-perftools-dev

Compile NSG:

$ git clone https://github.com/SNU-ARC/nsg
$ cd nsg/
$ git checkout ADA-NNS
$ cd build/
$ ./build.sh

Usage

We provide the script which can build and search for in memory-resident indices. The scripts locate under directory tests/. For the description of original main binaries, please refer to original readme.

Building NSG Index

To use NSG for ANNS, an NSG index must be built first. Here are the instructions for building NSG.

Step 1. Build kNN Graph

Firstly, we need to prepare a kNN graph.

NSG suggests to use efanna_graph to build this kNN graph. We provide the script to build kNN graphs for various datasets. Please refer to efanna_graph and checkout ADA-NNS branch.

You can also use any alternatives, such as Faiss.

The parameters used to build each graphs are as follows.

Dataset	K	L	iter	S	R
SIFT1M	200	200	10	10	100
GIST1M	400	400	12	15	100
CRAWL	400	420	12	15	100
DEEP1M	400	420	12	20	200
MSONG	200	200	10	10	100
GLOVE-100	400	420	12	20	200

The built graphs should be located in the directory nsg/build/tests/. as the following format.

e.g., sift1M_200nn.graph, gist1M_400nn.graph

Step 2. Build NSG index and search via NSG index

Secondly, we will convert the kNN graph to our NSG index and perform search.

The parameters used to build each indices are as follows.

Dataset	L	R	C
SIFT1M	40	50	500
GIST1M	60	70	500
CRAWL	150	50	1000
DEEP1M	200	40	1000
MSONG	40	50	500
GLOVE-100	60	70	500

Searching with NSG Index

Dataset should be located in the directory nsg/build/tests/. as the following format.

e.g., sift1M, gist1M

To use the greedy search, use the tests/evaluate_baseline.sh script:

$ cd tests/
$ ./evaluate_baseline.sh [dataset]

The argument is as follows:

(i) dataset: Name of the dataset. The script supports various real datasets (e.g., SIFT1M, GIST1M, CRAWL, DEEP1M, msong, glove-100).

To change parameter for search (e.g., K, L, number of threads), open evaluate_baseline.sh and modify the parameter K, L_SIZE, T.

To use the ADA-NNS, use the tests/evaluate_ADA-NNS.sh script:

$ cd tests/
$ ./evaluate_ADA-NNS.sh [dataset]

The arguments are same as above in evaluate_baseline.sh.

To change parameter for search (e.g., K, L, number of threads), open evaluate_ADA-NNS.sh and modify the parameter K, L_SIZE, T.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

NSG with ADA-NNS

Building Instruction

Prerequisites

Compile On Ubuntu/Debian

Usage

Building NSG Index

Step 1. Build kNN Graph

Step 2. Build NSG index and search via NSG index

Searching with NSG Index

Files

README.md

Latest commit

History

README.md

File metadata and controls

NSG with ADA-NNS

Building Instruction

Prerequisites

Compile On Ubuntu/Debian

Usage

Building NSG Index

Step 1. Build kNN Graph

Step 2. Build NSG index and search via NSG index

Searching with NSG Index