Dylan  Iqbal

Dylan Iqbal

1556598660

Pandas Cheat Sheet for Data Science in Python

A quick guide to the basics of the Python data analysis library Pandas, including code samples.

The Pandas library is one of the most preferred tools for data scientists to do data manipulation and analysis, next to matplotlib for data visualization and NumPy, the fundamental library for scientific computing in Python on which Pandas was built.

The fast, flexible, and expressive Pandas data structures are designed to make real-world data analysis significantly easier, but this might not be immediately the case for those who are just getting started with it. Exactly because there is so much functionality built into this package that the options are overwhelming.

That’s where this Pandas cheat sheet might come in handy.

It’s a quick guide through the basics of Pandas that you will need to get started on wrangling your data with Python.

As such, you can use it as a handy reference if you are just beginning their data science journey with Pandas or, for those of you who already haven’t started yet, you can just use it as a guide to make it easier to learn about and use it.

Pandas Cheat Sheet for Data Science in Python

The Pandas cheat sheet will guide you through the basics of the Pandas library, going from the data structures to I/O, selection, dropping indices or columns, sorting and ranking, retrieving basic information of the data structures you’re working with to applying functions and data alignment.

In short, everything that you need to kickstart your data science learning with Python!

Python For Data Science Cheat Sheet: Pandas Basics

Use the following import convention:

>>> import pandas as pd

Pandas Data Structures

Series

A one-dimensional labeled array capable of holding any data type

>>> s = pd.Series([3, -5, 7, 4],  index=['a',  'b',  'c',  'd'])
A3
B5
C7
D4

DataFrame

A two-dimensional labeled data structure with columns of potentially different types

>>> data = {'Country': ['Belgium',  'India',  'Brazil'],

'Capital': ['Brussels',  'New Delhi',  'Brasilia'],

'Population': [11190846, 1303171035, 207847528]}

 


 

 CountryCapitalPopulation
1BelgiumBrussels11190846
2IndiaNew Delhi1303171035
3BrazilBrasilia207847528

Please note that the first column 1,2,3 is the index and Country,Capital,Population are the Columns.

Asking For Help

>>> help(pd.Series.loc)

I/O

Read and Write to CSV

>>> pd.read_csv('file.csv', header=None, nrows=5)
>>> df.to_csv('myDataFrame.csv')

Read multiple sheets from the same file

>>> xlsx = pd.ExcelFile('file.xls')
>>> df = pd.read_excel(xlsx,  'Sheet1')

Read and Write to Excel

>>> pd.read_excel('file.xlsx')
>>> df.to_excel('dir/myDataFrame.xlsx',  sheet_name='Sheet1')

Read and Write to SQL Query or Database Table

(read_sql()is a convenience wrapper around read_sql_table() and read_sql_query())

>>> from sqlalchemy import create_engine
>>> engine = create_engine('sqlite:///:memory:')
>>> pd.read_sql(SELECT * FROM my_table;, engine)
>>> pd.read_sql_table('my_table', engine)
>>> pd.read_sql_query(SELECT * FROM my_table;', engine)
>>> df.to_sql('myDf', engine)

Selection

Getting

Get one element

>>> s['b']
-5

Get subset of a DataFrame

>>> df[1:]
Country     Capital   Population
1  India    New Delhi 1303171035
2  Brazil   Brasilia  207847528

Selecting’, Boolean Indexing and Setting

By Position

Select single value by row and and column

>>> df.iloc([0], [0])
'Belgium'
>>> df.iat([0], [0])
'Belgium'

By Label

Select single value by row and column labels

>>> df.loc([0],  ['Country'])
'Belgium'
>>> df.at([0],  ['Country'])
'Belgium'

By Label/Position

Select single row of subset of rows

>>> df.ix[2]
Country      Brazil
Capital    Brasilia
Population  207847528

Select a single column of subset of columns

>>> df.ix[:, 'Capital']
0     Brussels
1    New Delhi
2     Brasilia

Select rows and columns

>>> df.ix[1, 'Capital']
'New Delhi'

Boolean Indexing

Series s where value is not >1

>>> s[~(s > 1)]

s where value is <-1 or >2

>>> s[(s < -1) | (s > 2)]

Use filter to adjust DataFrame

>>> df[df['Population']>1200000000]

Setting

Set index a of Series s to 6

>>> s['a'] = 6

Dropping

Drop values from rows (axis=0)

>>> s.drop(['a',  'c'])

Drop values from columns(axis=1)

>>> df.drop('Country', axis=1) 

Sort and Rank

Sort by labels along an axis

>>> df.sort_index()

Sort by the values along an axis

>>> df.sort_values(by='Country') 

Assign ranks to entries

>>> df.rank()

Retrieving Series/DataFrame Information

Basic Information

(rows, columns)

>>> df.shape

Describe index

>>> df.index

Describe DataFrame columns

>>> df.columns

Info on DataFrame

>>> df.info()

Number of non-NA values

>>> df.count()

Summary

Sum of values

>>> df.sum()

Cumulative sum of values

>>> df.cumsum()

Minimum/maximum values

>>> df.min()/df.max()

Minimum/Maximum index value

>>> df.idxmin()/df.idxmax() 

Summary statistics

>>> df.describe()

Mean of values

>>> df.mean()

Median of values

>>> df.median()

Applying Functions

>>> f = lambda x: x*2

Apply function

>>> df.apply(f)

Apply function element-wise

>>> df.applymap(f)

Internal Data Alignment

NA values are introduced in the indices that don’t overlap:

>>> s3 = pd.Series([7, -2, 3],  index=['a',  'c',  'd'])
>>> s + s3
a     10.0
b     NaN
c     5.0
d     7.0

Arithmetic Operations with Fill Methods

You can also do the internal data alignment yourself with the help of the fill methods:

>>> s.add(s3, fill_value=0)
a    10.0
b    -5.0
c    5.0
d    7.0
>>> s.sub(s3, fill_value=2)
>>> s.div(s3, fill_value=4)
>>> s.mul(s3, fill_value=3)

Original article source at https://www.datacamp.com

#python #pandas #datascience #machinelearning

What is GEEK

Buddha Community

Pandas Cheat Sheet for Data Science in Python

Gaurav Sharma

1618484906

Office.com/setup - Microsoft Office Setup 365 mit Schlüssel office.com/setup

Gaurav Sharma

1618484926

Wir sind hier, um Ihnen bei allen Verfahren zu helfen, wie Sie zum ersten Mal mit HP Printer beginnen können123 hp com

Gaurav Sharma

1620039190

The world’s first, 100% common arrangement intended to traget low center internal heat level meticore supplement /
what is the best supplement for weight loss /
meticore /
meticore supplement reviews

Gaurav Sharma

1620039242

Office.com/setup - Microsoft Office Setup 365 mit Schlüssel office.com/setup /
office setup /
office installieren /

Download & Install office.com/setup – Geben Sie den Produktschlüssel ein, um Office zu aktivieren:office.com/setup /
office setup /
office installieren.

Python for Data Science Cheat Sheet (2021)

The absolute basics for beginners learning Python in 2021

Python has become the most popular computing language to perform data science in 2021. But before you can make astounding deep learning and machine learning models you need to know the basics of Python and the different types of objects first.

Check out the different sections below to learn the various types of objects and their capabilities.

Sections:

_1. _Variables and Data Types

_2. _Lists

_3. _Strings

_4. _NumPy Arrays

_5. _Libraries

#data-scientist #deep-learning #python #data-science #python for data science cheat sheet (2021) #cheat sheet

akshay L

akshay L

1610872689

Data Science With Python Training | Python Data Science Course | Intellipaat

In this Data Science With Python Training video, you will learn everything about data science and python from basic to advance level. This python data science course video will help you learn various python concepts, AI, and lots of projects, hands-on demo, and lastly top trending data science and python interview questions. This is a must-watch video for everyone who wishes o learn data science and python to make a career in it.

#data science with python #python data science course #python data science #data science with python

Uriah  Dietrich

Uriah Dietrich

1618449987

How To Build A Data Science Career In 2021

For this week’s data science career interview, we got in touch with Dr Suman Sanyal, Associate Professor of Computer Science and Engineering at NIIT University. In this interview, Dr Sanyal shares his insights on how universities can contribute to this highly promising sector and what aspirants can do to build a successful data science career.

With industry-linkage, technology and research-driven seamless education, NIIT University has been recognised for addressing the growing demand for data science experts worldwide with its industry-ready courses. The university has recently introduced B.Tech in Data Science course, which aims to deploy data sets models to solve real-world problems. The programme provides industry-academic synergy for the students to establish careers in data science, artificial intelligence and machine learning.

“Students with skills that are aligned to new-age technology will be of huge value. The industry today wants young, ambitious students who have the know-how on how to get things done,” Sanyal said.

#careers # #data science aspirant #data science career #data science career intervie #data science education #data science education marke #data science jobs #niit university data science

Data Science with Python Certification Training in Chennai

Learn Best data science with python Course in Chennai by Industry Experts & Rated as and Best data science with python training in Chennai. Call Us Today!

#data science with python training #data science with python courses #data science with python #data science with python course

Manish Sharma

1602244583

Applied Data Science with Python Certification Training Course -IgmGuru

IgmGuru’s Data Science with Python certification course has been designed after consulting some of the best in the industry and also the faculty who are teaching at some of the best universities. The reason we have done this is because we wanted to embed the topics and techniques which are practiced and are in demand in the industry – conduct them with the help of pedagogy which is followed across universities – kind of applied data science with python. In doing so, we make our learners more industry/job-ready. IgmGuru’s Data Science with Python online training course is the gateway towards your Data Science career.

#applied data science with python #data science with python certification #data science with python online training #data science with python training