This course is an introductory course to help learners to understand some methods used in Exploratory Data Analysis EDA using Pandas. This course is intended for absolute beginners and the course assumes that learners are comfortable coding in the Jupyter notebook and have already installed the necessary IDE for programming in Python.
In this century of big data, it is considered a plus if an individual knows how to manipulate data and clean it, and come out with a story about the data and how you can get business insights from such data to improve business.
Pandas is a library popular for data manipulation and data analysis including numerical tabular data.
You will learn to use some basic methods to explore a COVID19 dataset downloaded from the World Health Organization website.
You will learn to use basic methods to describe your data, methods to insert new columns into your dataset, methods to identify duplicated values, methods to drop columns in the dataset, you will learn to do some mathematical computations on columns values such as sum or multiplying values of different columns to create or insert a new column and find null values.
The methods learned in this beginner course can serve as a foundation for which you will use for any other data analysis projects
It should be noted that the data visualization aspect of the course will be added to the course once they are being edited.
There are no reviews yet.