Skip to content

GuillePrograma94/python-pandas-tutorial

 
 

Repository files navigation

Pandas for machine learning

Pandas is the best and most popular Python library for machine learning. This library offers a wide variety of functions that will help you manipulate data, optimize your machine-learning algorithm, and much more. This tutorial will help you to get familiar with this library and master the most used functionalities with code samples and video tutorials that will help you to create your first data frame, clean a dataset of information, read CSV files, etc...

The exercises in this tutorial have been created after about 60 hours of development by many experts in machine learning and carefully reviewed by our contributors to make sure you have the most accurate and important information that will help you start your machine learning career.

Content table

In this tutorial, we will see the most important and basic functions provided by Pandas that will help you work with data in machine learning. The following are some of the topics that will be covered in this tutorial.

Exercise Description of the topic
Install Pandas These exercises cover how to install Pandas, how to import the Pandas library in a Python file, and how to create your first Python script.
DataSets These exercises explain what datasets are and how to work with them.
Series These exercises explain what Series are in Pandas and how to use them.
DataFrames These exercises explain how to create an information DataFrame and what functions can be used to work with them.
Clean DataSets This class covers what data cleaning is, the functions Pandas offers to clean up a dataset, and the best practices to use when cleaning a dataset.

Tutorial Installation

There are two ways to initialize this tutorial, the first and easiest is to open the tutorial in a cloud environment such as Codespaces or Gitpod, and the second is to clone this repository in your local environment.

We recommend that you use Codespaces because it is the easiest and fastest way to start the tutorial.

1. Open the tutorial in a cloud environment

You can start this tutorial in just a few seconds with Codespaces by clicking on the following link: open in codespaces (recommended), or you can use Gitpod by clicking on: open in gitpod.

Once you have opened the cloud environment, either Codespaces or Gitpod, the LearnPack exercises should start automatically. If the exercises do not start automatically you can open a terminal and type the command: learnpack start

2. Open the tutorial in your local environment

To start this tutorial in your local environment, follow the steps below:

  1. Open a terminal and clone this repository on your local environment. You can use the following command:
git clone https://github.com/4GeeksAcademy/python-pandas-tutorial.git 
  1. Make sure you have installed a Node.js version of 12.01.1 or higher:
node --version
  1. Install LearnPack, the package manager for learning tutorials, and also run the Python compiler plugin for LearnPack. You can do this with the following commands:
> npm i learnpack -g
> learnpack plugins:install learnpack-python
  1. Finally, Install Jest to perform the necessary tests throughout the tutorial and start the exercises with the following commands:
> npm i [email protected] -g
> learnpack start

Contributors

We would like to express our deepest gratitude to the following contributors for their invaluable support in making this tutorial possible.

Contributor GitHub account
Alejandro Sanchez alesanchezr
Martín Suárez kiddopro
Lorena Gubaira Lorenagubaira
Tomas Gonzalez tommygonzaleza
Hernán García hernanjkd
Ernesto Gonzalez UmiKami
Hector Chocobar hchocobar
Charly Chacón Charlytoc
Agustín Fernández Dasher83
Ignacio Cordoba nachovz

This tutorial and many other exercises are designed for students as part of the 4Geeks Academy's Coding Bootcamp. Currently, we have two courses available. The first one is the Full Stack Developer Course. In this course, you will learn technologies like HTML5, CSS3, JavaScript, Python, Flask, SQL and many others. The second one is the Data Science Bootcamp, where you will learn technologies like Python, Algorithms' basics, Pandas, SQL Database, and many other technologies. You can find more information about these courses and the upcoming Blockchain and Web3 course on the official 4Geeks Academy web page.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%