README.md

Big Data Processing

When one machine is not enough to process the data We divide and conquer → This essentially is Big Data Processing
Companies use this to process massive amount of data and extract insights out of it, train ml models, move data across databases, and much more
When too much data needs to be processed quickly, we can use big data tools
All these fancy processing on commodity hardware (These are not specialized hardware)

More computers, more cpu, more processing

Challenges

So how? That’s what Big Data tools are for

Large scale data processing on commodity hardware it has connectors to a lot of databases and infra components

eg: combine user, order, payments and logistics DBs and put the result in AWS Redshift