Collection of all things "Data Science" leaning towards Marketing & Advertisement (Digital Marketers, Agencies, Web Designers and Web Analysts)
- Software
- Libraries
- Learning
- Snowflake | Cloud Data Platform
- Spark | Distributed data processing framework for ML, analytics and more
- Databricks | Cloud platform for data engineering and data science by the makers of Spark (Community Edition available)
- Arc | Open-source Databricks alternative
- Supabase | open source Firebase alternative
- UKV | Replacing MongoDB, Neo4J, and Elastic with 1 transactional database. Features: zero-copy semantics, swappable backends, bindings for C, C++, Python, Java, GoLang
- Talend Open Studio
- Tibco Jaspersoft ETL
- Halzelcast Jet-start.sh - Distributed Streaming
- Mode | Interactive data science meets modern BI for fast, exploratory analysis company-wide
- DBT | Open-source tool to organize, cleanse, denormalize, filter, rename, and pre-aggregate raw data in warehouse for analysis.
- AirByte | Open-source ELT solution for simple data integration, owned by you
- PRQL | a modern language for transforming data — a simple, powerful, pipelined SQL replacement
- YoBulk | Opensource CSV importer powered by GPT3, flatfile alternative
- meltano | Extract & Load /with joy/ — CLI & version control for ELT without limitations
- Singer | Open-source composable data extraction for many sources and destinations
- Fivetran
- Stitchdata
- Panoply
- Electrik.ai | Extract Raw hit level Google Anayltics data
- OWOX | ETL specialized for Digital Marketing purposes
- Scitylana | Extract Raw hit level Google Analytics data
- Matillion | ETL & Transformation
- funnel.io | Marketing Data Extraction
- Adverity | Marketing Data Integration, Reporting and Analytics
- Metacat | Open-source Metadata management for Hive, RDS, Teradata, Redshift, S3 and Cassandra
- Amundsen Frontend Service | Open-source Metadata indexing for tables, dashboards, streams, etc. with page-rank style search
- TimescaleDB | open-source database for scalable SQL time-series based on PostgreSQL
- EventNative | open source, high-performance, event collection service
- Apache Ignite | open-source in-memory distributed database, caching, and processing platform for transactional, analytical, and streaming workloads
- Redis | open source in-memory advanced key-value store used as a database, cache and message broker
- Apache Kafka | Open-source distributed event streaming platform for high-performance data pipelines, streaming analytics, data integration
- InfluxDB | open-source time series database
- ClickHouse | open-source column oriented OLAP DBMS for realtime querying in SQL
- RQLite | open-source Lightweight, distributed SQLite database handling leader elections, tolerates failures of machines, including leader available for Linux, macOS, and Microsoft Windows
- DuckDB | open-source fast in-process SQL OLAP database
- Qdrant Vector Database for AI applications
- Datomic recently free (apache 2.0), fully transactional, cloud-ready, distributed database
- XTDB a bitemporal and dynamic relational database for SQL and Datalog
- undb open source, private first, self-hosted, no code database as an airtable alternative
- apitable an API-oriented low-code platform for building collaborative apps and better than all other Airtable open-source alternatives
- baserow an open source no-code database tool and Airtable alternative.
- Segment | Marketing CDP
- Ascent360 | Full-stack CEP/CDP
- SAP Emarsys | Full-stack CDP
- Firsthive | Marketing CDP to take control of first-party data from online and offline, and enable personalized campaigns
- Dynamics 365 Customer Insights | Full-stack CDP
- Piwik.pro | Marketing CDP
- Salesforce Customer Interaction | Full-stack CDP
- Tealium Audience Stream | Marketing CDP
- Treasuredata | Full-stack CDP
- Blueshift | Full-stack & Marketing CDP
- Exponea | Marketing CDP
- Rudderstack | Paid Customer Data Pipeline for Event streaming, Warehouse sync and ETL, open-source community edition available
- Apache Airflow | Workflow scheduler using Directed Acyclical Graphs in Python
- Apache Oozie | Workflow scheduler to manage Hadoop jobs as Directed Acyclical Graphs in Java and XML
- Luigi | open-source pipeline and batch job management
- ActivePieces | Open Source Zapier alternative for business automation
- Automatisch | Open SOurce Zapier alternative for business automation
- Mixpanel | self-serve product analytics to help you convert, engage, and retain more users
- Amplitude | Digital product and user analytics platform
- Snowplow | Open source Web, mobile and event analytics for AWS and GCP
- RRWeb | Open source web session recorder & player for user behaviour analysis
- Plausible Analytics | Open-source lightweight, privacy respecting cookie-less Google Analytics alternative
- Simple Analytics | Paid cookie-less, privacy respecting Google Analytics alternative
- Fathom Lite | Open-source self hosted Google Analytics alternative
- Shynet | Open-source cookie-less Google Analytics alternative
- Friendly Analytics | Paid privacy respecting Matomo/Piwik fork
- Google Analytics 4 | Cookie & Cookie-less App & Web Analytics platform with flexible event model and tight Google Marketing Cloud integration
- Cloudflare Web Analytics | Free cookie-less privacy respecting Google Analytics alternative
- Panelbear | Paid Cookie-less Google Analytics alternative
- Open Web Analytics | Open-source Google Analytics alternative
- GoAccess | Open-source realtime web-log analytics for terminal and web
- Matomo | (Formerly Piwik) Open-source Google Analytics alternative
- Analytics | Lightweight Open-source abstraction layer for web analytics and marketing tracking
- Umami | Open-source, light weight Google Analytics alternative
- Google Analytics Beacon | A proxy for Universal Analytics allows to track via image pixel, when no javascript is allowed
- Getinsights | Privacy-focused, cookie free analytics, free for up to 5k events/month.
- Keen.io | Managed Event Streaming Platform, built on Kafka, Storm, and Cassandra
- Quantcast Analytics | Audience Analytics with 3rd party data exchange
- Engauge | Open-source single binary app and web analytics with minimal requirements
- Counter.dev | Basic, open source non-intrusive web analytics
- Beam Analytics | Paid, closed source, GDPR compliant, cookieless, 100k pageviews/m unrestricted free tier
- Uptrace | Open source APM: OpenTelemetry traces, metrics, and logs
- Microanalytics | GDPR compliant simple paid web analytics hosted in the EU
- KNIME Analytics | open source software for creating visual data science workflows
- Jupyter Hub / Lab / Notebooks | Open source single & multiuser IDE for data science and machine learning supporting Python and more
- Knowage | Open source BI and Analytics suite
- Microsoft R Open | open source platform for statistical analysis and data science
- Orange3 | Open source machine learning and data visualization and visual data analysis workflows
- RanalyticFlow | Open source data analysis software built on R, for interactive data analysis with or without R programming
- Rapidminer Studio | data science platform that unites data prep, machine learning & predictive model deployment
- RStudio | Desktop and cloud based single or collaborative IDE for R
- Tibco Spotfire Analytics | AI-powered, search-driven experience with built-in data wrangling and advanced analytics
- Power BI
- Tableau
- Pentaho Business Analytics
- Panoply | Full service Analytics Stack
- QlikSense | Assisted BI Analytics suite
- Klipfolio | metrics, meaningful dashboards, and actionable reports
- Geckoboard | Self service dashboards for various Data Sources
- Google Data Studio | Free dashboard and reporting service focused on marketing data sources
- Sisense | BI Analytics suite
- Looker | BI Analytics suite
- Canopy.cloud | Reporting and Analytics for Investors and Wealth managers
- Chartio | Dashboard and Reporting platform
- Cyfe
- Metabase | Open-source data query and visualization for non-tech people
- Redash | Open-source data query and visualization for non-tech people
- Looker | Google acquired data visualization
- Apache Grafana | Open-source data visualization and monitoring
- Apache Superset | Open-source enterprise grade web based BI solution
- evidence | Business intelligence as code with SQL and markdown
- datasette exploring and publishing data of any shape or size using an encapsulated website and API, especially useful for data journalism
- Python in Excel Microsoft's official Python support within Excel
- AgencyAnalytics
- WhatGraph
- Reportz.io
- Swydo
- Megalytic
- Octoboard
- ReportingNinja
- SuperDash
- ReportGarden
- DashThis
- dstack - an open-source framework for building data science applications using Python and R
- Analytics Zoo | Unified Data Analytics and AI stack with TensorFlow, Keras and Pytorch and seamless deployment
- EasySpider visual code-free/no-code web crawler/spider
- Plot JavaScript library for exploratory data visualization Create expressive charts with concise code
- Tremor React library to build dashboards fast
- apexcharts.js
- AutoViz, Automatically Visualize any dataset, any size with a single line of code.
- PyGWalker | TUrn pandas dataframe into Tableau-style UI for Data Analysis
- MLBox | powerful Automated Machine Learning python library
- PyCaret | open source low-code machine learning library in Python
- PyTorch Lightning | open-source Python library as a high-level interface for PyTorch
- igel | machine learning tool to train/fit, test and use models without writing code
- Ludwig | toolbox to train and evaluate deep learning models without writing code built on TensorFlow
- fast.ai | training fast and accurate neural nets using modern best practices
- Apache Arrow | Open-source cross-language development platform for in-memory analytics
- auto-sklearn | Automated Machine Learning for Python
- Google Ads Performance Pipeline | Data integration pipeline from GAds to PostgreSQL
- Text-Mining-Search-Query | segements search terms to search words and summarizes performance metrics
- CRMint | reliable data integration and processing for advertisers
- Official Google Ads Python library | client library for Google Ads API
- Official Google Ads Python library | SOAP Ads APIs for AdWords and DoubleClick for Publishers
- GAQL CLI| Running GoogleAds queries
- advertools | SEM, SEO, Social productivity & analysis tools to scale your online marketing
- pyaw-reporting | AdWords API large scale reporting tool written in Python
- Typesense | fast, small, typo-tolerant search engine, when Elasticsearch is too big
- Elasticsearch | distributed RESTful search engine built for the cloud
- Lucene Solr | enterprise search platform written in Java and using Apache Lucene
- Time-Series-Library A Library for Advanced Deep Time Series Models with iTransformers performing SOTA forecasting with inverted transformers
- neuralforecast formerly TimeGPT, Scalable and user friendly neural forecasting algorithms
- Andrew Ng, Coursera, 54h Machine Learning
- Herbert Lee, Coursera, 10h Bayesian Statistics: From Concept to Data Analysis
- Jeff Leek, Coursera/John Hopkins University, 300h Data Science Specialization
- CS109 Data Science, Harward CS109
- Rafael Irizarry, Michael Love, Hardward/edX, 16h Statistics and R
- Dave Holtz, Cheng-Han Lee, Udacity Intro to Data Science
- Christopher Brooks, University of Michigan, 120h Applied Data Science with Python Specialization
- Saeed Aghabozorgi, IBM, 20h Machine Learning with Python
- Jose Portilla, Udemy, 25h Python for Data Science and Machine Learning Bootcamp
- https://www.udemy.com/course/datascience
- Learnpython Learnpython.org
- W3 Schools SQL
- Bayesian Methods for Hackers
- Naked Statistics: Stripping the Dread from the Data
- How to lie with statistics
- An Introduction to Statistical Learning: with Applications in R
- Naked Statistics: Stripping the Dread from the Data
- Effective Data Visualization: The Right Chart for the Right Data
- Data-Driven Storytelling (AK Peters Visualization Series
- Making Data Visual: A Practical Guide to Using Visualization for Insights
- Data Visualisation: A Handbook for Data Driven Design
- Storytelling with Data: A Data Visualization Guide for Business Professionals
- Fundamentals of Data Visualization: A Primer on Making Informative and Compelling Figures
- The Art of Data Science
- Python: Data Analytics and Visualization
- Dash for Python | Build web analytics application in Python in a few hours
- Python Data Science Handbook
- Big Data Science & Analytics: A Hands-On Approach
- Data Science from Scratch: First Principles with Python
- Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are