RecSys-for-CSD

Embedding tables are a crucial part of the Recommendation System (RecSys), and they are all placed in memory at the current stage. As the data volume keeps growing, the memory will run out someday. If the embedding tables were stored in storage without optimizations, the latency would not meet the SLA requirement. The embedding lookup is demonstrated to be the large portion of running RecSys, which usually fetches valid data in kilobytes from the whole table in gigabytes. Offloading embedding lookups to the storage will eliminate the transmission of unwanted data.

The computational storage will be applied to the RecSys developed by Facebook (DLRM). We will integrate three customized CSD platforms into DLRM at the same time for the end-to-end benchmark. The evaluation of the Kaggle benchmark for DLRM achieves1-2 ms per query. And compared with conventional SSD, the I/O traffic is reduced by 80%. Additionally, the energy consumption is reduced by 20%.

Steps to run test

Train dlrm model with dataset, and save model (only need to be done once);
Convert/transform the saved model into an embedding-only binary file (only need to be done once);
Run test for SSD with limited page cache and for CSD, and monitor the I/O traffic.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Creating Tables.mp4		Creating Tables.mp4
README.md		README.md
Running test for CSD.mp4		Running test for CSD.mp4
Running test for file IO.mp4		Running test for file IO.mp4
architecture.png		architecture.png
control_flow.png		control_flow.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RecSys-for-CSD

Steps to run test

Architecture

Control Flow

About

Releases

Packages

BaiShuhan/RecSys-for-CSD

Folders and files

Latest commit

History

Repository files navigation

RecSys-for-CSD

Steps to run test

Architecture

Control Flow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages