Skip to content
View vjsrinivas's full-sized avatar

Highlights

  • Pro

Block or report vjsrinivas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Jupyter Notebook 3,217 432 Updated Jun 12, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,000 4,584 Updated Aug 16, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,116 429 Updated Jan 9, 2025

[CVPR 2024] Rewrite the Stars

Python 339 16 Updated May 7, 2024

TensorFlow Lite Micro for Espressif Chipsets

C++ 416 95 Updated Jan 20, 2025

Optimised Neural Network functions for Espressif chipsets

Assembly 151 25 Updated Sep 16, 2024

Helper functions to create COCO datasets

Python 776 180 Updated Jun 20, 2024

This repo is an implementation of PyTorch version YOLOV Series

Python 345 49 Updated Dec 19, 2024

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

Python 5,825 1,050 Updated Aug 8, 2024

Gradient based receptive field estimation for Convolutional Neural Networks

Jupyter Notebook 337 25 Updated Jan 11, 2020

[CVPR 2023] Code for PConv and FasterNet

Python 724 57 Updated May 16, 2023

Official implementation for DMNet: Density map guided object detection in aerial image (CVPR 2020 EarthVision workshop)

Python 100 15 Updated Jul 1, 2022

🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.

C++ 591 95 Updated Jan 20, 2025
Python 28 5 Updated Oct 24, 2024

[NeurlPS2024] One-Step Effective Diffusion Network for Real-World Image Super-Resolution

Python 301 21 Updated Dec 13, 2024

[TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Jupyter Notebook 128 4 Updated Nov 14, 2024

A very compact representation of a placeholder for an image.

C 16,012 368 Updated Jul 8, 2024

Zstandard - Fast real-time compression algorithm

C 24,161 2,165 Updated Jan 20, 2025

[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation

Python 5,511 461 Updated Sep 9, 2024

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 797 56 Updated Nov 22, 2024
Python 172 12 Updated Sep 26, 2024

[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 2…

C 822 134 Updated Nov 27, 2024

Implementations of few-shot object detection benchmarks

Python 1,125 225 Updated Nov 21, 2023

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Python 4,199 376 Updated Aug 1, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 15,322 1,435 Updated Jan 19, 2025

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 857 29 Updated Aug 9, 2024

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,797 224 Updated Jan 11, 2025

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Python 1,257 77 Updated Apr 18, 2024
Next
Showing results