Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data Engineering for AI/ML - September 12, 2024 - Fully Virtual #38

Open
deepyaman opened this issue Jul 26, 2024 · 0 comments
Open

Data Engineering for AI/ML - September 12, 2024 - Fully Virtual #38

deepyaman opened this issue Jul 26, 2024 · 0 comments

Comments

@deepyaman
Copy link

https://home.mlops.community/public/events/dataengforai

Title

Building the Python-first composable analytics stack

Abstract

SQL has reigned king of the data transformation world, and tools like dbt have formed a cornerstone of the modern data stack. However, the rise of composable data systems combined with the emergence of key open-source technologies over the past few years gives data engineers the power to choose the right interface for them. Now, Ibis can provide the same benefits of SQL execution with a flexible Python dataframe API, and we can leverage it to build scalable Python pipelines in Kedro. dlt leverages the power of Apache Arrow for performant EL workflows, while Pandera (via the Ibis backend and Kedro-Pandera integration) provides fully-integrated data validation using the execution engine of your choice.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: backlog
Status: Submitted
Development

No branches or pull requests

1 participant