Skip to content
@KubedAI

Kube-dAI

All about Data and AI on Kubernetes

Kube-dAI

Kube (Kubernetes) and dAI( Data and AI)

Kube-dAI: Scalable Data and AI Deployment Patterns on Kubernetes

Welcome to Kube-dAI, an open-source project dedicated to providing highly scalable deployment patterns and infrastructure-as-code templates for data processing and AI workloads on Kubernetes.

๐Ÿš€ About Kube-dAI

Kube-dAI aims to empower organizations with battle-tested, scalable architectures for running data and AI workloads on Kubernetes. Our project offers:

  • Highly scalable deployment patterns
  • Best practices for compute and storage optimization
  • Comprehensive benchmarks for performance evaluation
  • Infrastructure-as-code templates for rapid deployment

Whether you're orchestrating large-scale data processing pipelines or deploying cutting-edge AI models, Kube-dAI provides the insights and tools you need to maximize your Kubernetes infrastructure.

๐Ÿ” Project Focus

Our project delves deep into compute and storage best practices for each deployment, whether it's data processing or generative AI inference. We cover two main categories:

Data Processing

  • Apache Spark
  • Apache Flink
  • Trino (formerly PrestoSQL)

Artificial Intelligence

  • KServe
  • Ray
  • NVIDIA Triton Inference Server
  • LWS (Leader Worker Set)

For each technology, we provide:

  • Scalable architecture designs
  • Performance optimization techniques
  • Resource allocation strategies
  • Storage configuration best practices
  • Comprehensive benchmarks

๐Ÿ“Š Benchmarks and Best Practices

Each repository includes detailed benchmarks and best practices, covering:

  • Scalability tests (from gigabytes to petabytes)
  • Throughput and latency measurements
  • Resource utilization optimizations
  • Cost-efficiency analyses
  • Comparative studies of storage solutions
  • Best practices for specific workload types

๐Ÿ“‚ Repository Structure

Our GitHub organization hosts multiple repositories, each dedicated to a specific technology:

Data Processing:

  • spark-on-k8s: Apache Spark deployment patterns and benchmarks
  • flink-on-k8s: Apache Flink deployment patterns and benchmarks
  • trino-on-k8s: Trino deployment patterns and benchmarks

Artificial Intelligence:

  • kserve-on-k8s: KServe deployment patterns and benchmarks
  • ray-on-k8s: Ray deployment patterns and benchmarks
  • triton-on-k8s: NVIDIA Triton Inference Server deployment patterns and benchmarks
  • lws-on-k8s: LWS deployment patterns and benchmarks

Additional Repositories:

  • genai-inference-patterns: Generative AI inference optimization techniques
  • data-storage-benchmarks: Comparative studies of storage solutions for big data
  • ai-model-serving-best-practices: Best practices for serving AI models at scale

๐Ÿ’ป Getting Started

To leverage Kube-dAI for your projects:

  1. Explore our repositories to find relevant technologies and patterns.
  2. Clone the repository of interest:
    git clone https://github.com/Kube-dAI/<repository-name>.git
    
  3. Follow the detailed README in each repository for setup, benchmarking, and best practices.

๐Ÿค Contributing

We welcome contributions from the community! To contribute:

  1. Fork the relevant repository.
  2. Create a new branch for your feature, benchmark, or optimization.
  3. Submit a pull request with your improvements.

Please review our Contribution Guidelines for more details.

๐Ÿ“œ License

This project is open-source and available under the Apache License 2.0.

๐Ÿ“ž Support

For assistance or to discuss advanced use cases:

  • Open an issue in the relevant repository
  • Join our community Slack (placeholder link)
  • Explore our comprehensive documentation (placeholder link)

๐Ÿ”— Useful Links


Kube-dAI: Empowering Scalable Data and AI Deployments on Kubernetes

Popular repositories Loading

  1. spark-history-server spark-history-server Public

    Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs

    Shell 16 9

  2. .github .github Public

  3. airflow-dags airflow-dags Public

    Sample DAGs repo to use with Apache Airflow GitSync feature.

    Python 6

Repositories

Showing 3 of 3 repositories
  • spark-history-server Public

    Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs

    KubedAI/spark-history-serverโ€™s past year of commit activity
    Shell 16 Apache-2.0 9 5 0 Updated Sep 19, 2024
  • .github Public
    KubedAI/.githubโ€™s past year of commit activity
    0 0 0 0 Updated Sep 7, 2024
  • airflow-dags Public

    Sample DAGs repo to use with Apache Airflow GitSync feature.

    KubedAI/airflow-dagsโ€™s past year of commit activity
    Python 0 6 0 0 Updated Jun 28, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loadingโ€ฆ

Most used topics

Loadingโ€ฆ