Skip to content
Change the repository type filter

All

    Repositories list

    • LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
      Python
      MIT License
      82000Updated Oct 4, 2024Oct 4, 2024
    • A copy of nanotron for multilingual training
      Python
      Apache License 2.0
      108003Updated Oct 4, 2024Oct 4, 2024
    • nanotron

      Public
      Minimalistic large language model 3D-parallelism training
      Python
      Apache License 2.0
      108715Updated Oct 1, 2024Oct 1, 2024
    • llm-proxy

      Public
      LLM Serving and User Control
      Python
      1000Updated Aug 30, 2024Aug 30, 2024
    • Containers for multimodal initiative (and maybe more across Swiss AI?)
      Dockerfile
      0000Updated Aug 7, 2024Aug 7, 2024
    • ml-4m-v2

      Public
      0000Updated Aug 5, 2024Aug 5, 2024
    • ml-4m

      Public
      4M: Massively Multimodal Masked Modeling (NeurIPS 2023 Spotlight)
      Python
      Apache License 2.0
      910134Updated Aug 5, 2024Aug 5, 2024
    • Easily create large video dataset from video urls
      Python
      MIT License
      65101Updated Aug 2, 2024Aug 2, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      4.1k000Updated Jul 31, 2024Jul 31, 2024
    • Tool set for data preparation and selection in the context of Swiss-AI (forked from DataTrove)
      Python
      Apache License 2.0
      138000Updated Jul 15, 2024Jul 15, 2024
    • PDF pipeline for creating training corpora (mainly for llm, multimodal and alignment horizontals)
      Python
      Apache License 2.0
      0000Updated May 8, 2024May 8, 2024
    • MoE

      Public
      some mixture of experts architecture implementations
      Python
      Apache License 2.0
      1500Updated Mar 22, 2024Mar 22, 2024
    • distributed trainer for LLMs
      Python
      Other
      76000Updated Feb 8, 2024Feb 8, 2024