Skip to content

πŸ€– Gemini AI Chat + πŸ”„ Real-time WebSocket + πŸ–ΌοΈ Multimodal = Your next-gen chat app! Built with Google's Gemini AI for seamless text & image interactions in real-time. Fast, secure, and ready to deploy! ⚑️

Notifications You must be signed in to change notification settings

jadouse5/gemini-realtime-multimodal

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ€– Gemini Realtime Multimodal Chat

A real-time multimodal chat application powered by Google's Gemini AI. This project enables real-time communication with Gemini's advanced AI capabilities through WebSocket connections, supporting both text and multimodal interactions.

View on GitHub


✨ Features

  • 🎯 Real-time Gemini AI Integration
  • πŸ–ΌοΈ Multimodal Support (Text + Images)
  • πŸ”„ WebSocket-based Real-time Communication
  • πŸ” Secure Environment Configuration
  • ⚑ Async/Await Implementation
  • πŸ“± Cross-platform Support

πŸ› οΈ Requirements

google-genai==0.2.2
websockets
python-dotenv

πŸš€ Getting Started

  1. Clone the repository:

    git clone https://github.com/jadouse5/gemini-realtime-multimodal.git
    cd gemini-realtime-multimodal
  2. Set up virtual environment:

    python -m venv venv
    source venv/bin/activate  # On Windows: venv\Scripts\activate
  3. Install dependencies:

    pip install -r requirements.txt
  4. Configure environment: Create .env file:

    GOOGLE_API_KEY=your_gemini_api_key_here

πŸ’» Usage

  1. Start the server:

    python main.py
  2. Connect to WebSocket:

python -m http.server 8000

Connect using your preferred WebSocket client


πŸ”§ Configuration Options

  • GOOGLE_API_KEY: Your Gemini API key
  • PORT: WebSocket server port (default: 8765)
  • HOST: WebSocket server host (default: localhost)

🀝 Contributing

Contributions are welcome! To contribute:

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit changes (git commit -m 'Add AmazingFeature')
  4. Push to branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

πŸ“ License

This project is licensed under the MIT License - see the LICENSE file for details.


πŸ‘€ Author

Jad Tounsi


πŸ™ Acknowledgments

  • Google Gemini AI team
  • WebSocket protocol contributors
  • Python async/await community

πŸ“š Documentation

For detailed documentation on the Gemini API, visit:


Made with ❀️ and Python

This README has been updated to specifically match your repository at jadouse5/gemini-realtime-multimodal and includes relevant sections for a multimodal AI chat application using Gemini.

About

πŸ€– Gemini AI Chat + πŸ”„ Real-time WebSocket + πŸ–ΌοΈ Multimodal = Your next-gen chat app! Built with Google's Gemini AI for seamless text & image interactions in real-time. Fast, secure, and ready to deploy! ⚑️

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published