Skip to content
Change the repository type filter

All

    Repositories list

    • test_attn

      Public
      Testing and benchmarking different attention implementations and backends
      Python
      2300Updated Nov 22, 2024Nov 22, 2024
    • Triton kernels for Flux
      Python
      MIT License
      01700Updated Nov 9, 2024Nov 9, 2024
    • flux

      Public
      Official inference repo for FLUX.1 models
      Python
      Apache License 2.0
      1.3k601Updated Oct 21, 2024Oct 21, 2024
    • Optimize Flux on tinygrad
      Python
      MIT License
      1000Updated Oct 18, 2024Oct 18, 2024
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k000Updated Oct 10, 2024Oct 10, 2024
    • nexfort

      Public
      OneDiff compiler infrastructure using torch Inductor
      Python
      0100Updated Sep 26, 2024Sep 26, 2024
    • kernels

      Public
      Python
      17000Updated Sep 9, 2024Sep 9, 2024
    • xDiT

      Public
      xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters
      Python
      Apache License 2.0
      56000Updated Aug 30, 2024Aug 30, 2024
    • quanto

      Public
      Python
      0110Updated Aug 14, 2024Aug 14, 2024
    • Minimal implementation of scalable rectified flow transformers, based on SD3's approach
      Jupyter Notebook
      Apache License 2.0
      37500Updated Jul 27, 2024Jul 27, 2024
    • Python
      0000Updated Jul 25, 2024Jul 25, 2024
    • Cuda
      MIT License
      30000Updated Jul 11, 2024Jul 11, 2024
    • unet.cu

      Public
      UNet diffusion model in pure CUDA
      Cuda
      28000Updated Jun 28, 2024Jun 28, 2024