66% OFF OFFER Dell Ms116 275-BBCB Optical w ired Mouse at just 219/- Data science consulting companies are a hot choice if you’re looking for a job in the field. They offer numerous development opportunities, access to the latest technologies, and provide data-based solutions for top-notch companies across the globe. Furthermore, on top of generous salaries, they seem to have tons of cool perks – from unlimited vacation days and free meals to hair salons and masseuses on site. This doesn’t make your choice any simpler, though. With so many industries and companies out there, it’s hard to keep track of who-offers-what-and-where. So, watch this video to find out which companies provide the best overall employee experience in 2020! Freshers for Data Science Roles: Mu Sigma, Fractal Analytics, Exponentia, Clover Infotech,...
Get link
Facebook
X
Pinterest
Email
Other Apps
pandas
Get link
Facebook
X
Pinterest
Email
Other Apps
What is Pandas python and installation of Pandas..? pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool,built on top of the Python programming language.
Installation:
Dataframe Basics:
DataFrame is a main object of pandas. It is used to represent tabular data (with rows and columns).
1) What is dataframe?
Pandas DataFrame is two-dimensional size-mutable, potentially heterogeneous tabular data structure with labeled axes (rows and columns). A Data frame is a two-dimensional data structure, i.e., data is aligned in a tabular fashion in rows and columns. Pandas DataFrame consists of three principal components, the data, rows, and columns.
2) Create dataframe from csv file and python dictionary
3) Dealing with rows and columns
4) Operations: mean, max, std, describe
5) Conditional selection
6) set_index function and usefulness of it
Different Ways of Creating Data frames:
Create dataframe using read_csv() method
Create dataframe using read_excel() method
Create dataframe using python dictionary DataFrame() method
Create dataframe using tuples list DataFrame() methodCreate dataframe using the list of dictionary DataFrame() method
Read Write Excel csv files in Pandas:
1) Different options on cleaning up messy data while reading csv/excel files
2) Use convertors to transform data read from excel file
3) Export only portion of dataframe to excel file
Topics covered:
Read CSV file using read_csv() method
Skip rows in dataframe using "skiprows"
Import data from CSV file with "null header"
Read limited data from CSV file
Clean up messy data from file "not available" and "n.a." replace with "na_values"
Supply dictionary for replace with "na_values"
Write dataframe into "csv" file with "to_csv() method"
Read excel file using read_excel() method
Converters argument in read_excel() method
Write dataframe into "excel" file with "to_excel() method"
Use ExcelWritter() class
All properties for Read Write Excel CSV File
How to handle Missing Datas:
to handle missing data in pandas using fillna, interpolate and dropna methods. You can fill missing values using a value or list of values or use one of the interpolation methods.
Convert string column into the date type
Use date as an index of dataframe usine set_index() method
Use fillna() method in dataframe
Use fillna(method="ffill") method in dataframe
Use fillna(method="bfill") method in dataframe
"axis" parameter in fillna() method in dataframe
"limit" parameter in fillna() method in dataframe
interpolate() to do interpolation in dataframe
interpolate() method "time"
dropna() method Drop all the rows which has "na" in dataframe
"how" parameter in dropna() method
"thresh" parameter in dropna() method
Handle Missing datas - Replace function:
replace method can be used to replace specific values with some other values. It supports replacement using single value, a list, a regular expression and a dictionary. Often times you get data in one form and want to transform data into some other form as far as values are concerned. At this time replace method can be used to perform transformation.
How to use replace method to deal with missing data?
How to handle special values in data?
Use replace() method to replace values in dataframe
Replace values using a dictionary
How "regex" (regular expression) works
Replace data with "regex" using a dictionary
Replace the list of values with another list of values
GroupBy Method:
groupby method can be used to group your dataset based on some criteria and then apply analytics on each of the groups. This is similar to SQL group by. It is also called split apply combine strategy in data science.
Use groupby() method
groupby() representation internally
What is split apply combine?
Use describe() function in groupby
Concat dataframes:
concat function to join or append dataframes.
What is concat?
Concat two dataframe using concat() function
ignore_index argument in concat() function
List of arguments for concat() function
What is "keys"? pass "keys" to concat() function
"axis" argument in concat() function
Join dataframe with series() function
66% OFF OFFER Dell Ms116 275-BBCB Optical w ired Mouse at just 219/- Data science consulting companies are a hot choice if you’re looking for a job in the field. They offer numerous development opportunities, access to the latest technologies, and provide data-based solutions for top-notch companies across the globe. Furthermore, on top of generous salaries, they seem to have tons of cool perks – from unlimited vacation days and free meals to hair salons and masseuses on site. This doesn’t make your choice any simpler, though. With so many industries and companies out there, it’s hard to keep track of who-offers-what-and-where. So, watch this video to find out which companies provide the best overall employee experience in 2020! Freshers for Data Science Roles: Mu Sigma, Fractal Analytics, Exponentia, Clover Infotech,...
SQL Basics: Relational Databases A relational database is a database that stores related information across multiple tables and allows you to query information in more than one table at the same time. It's easier to understand how this works by thinking through an example. Imagine you're a business and you want to keep track of your sales information. You could set up a spreadsheet in Excel with all of the information you want to keep track of as separate columns: Order number, date, amount due, shipment tracking number, customer name, customer address, and customer phone number. This setup would work fine for tracking the information you need to begin with, but as you start to get repeat orders from the same customer you'll find that their name, address and phone number gets stored in multiple rows of your spreadsheet. As your business grows and the number of orders you're tracking increases, this redundant data will take up unnecessary space and generally ...
Comments
Post a Comment