Name	Name	Last commit message	Last commit date
parent directory ..
agt_data	agt_data
bokeh	bokeh
data	data
screenshots	screenshots
.gitignore	.gitignore
README.md	README.md
agt_analysis.ipynb	agt_analysis.ipynb
apply.ipynb	apply.ipynb
copy_on_write.ipynb	copy_on_write.ipynb
data_generation.ipynb	data_generation.ipynb
from_long_to_wide_and_back_again.ipynb	from_long_to_wide_and_back_again.ipynb
generate_csv_files.py	generate_csv_files.py
indexing_and_querying.ipynb	indexing_and_querying.ipynb
missing_values.ipynb	missing_values.ipynb
numba_and_pandas.ipynb	numba_and_pandas.ipynb
pandas_datatypes.ipynb	pandas_datatypes.ipynb
pandas_intro.ipynb	pandas_intro.ipynb
patient_data.ipynb	patient_data.ipynb
patients.ipynb	patients.ipynb
pipes.ipynb	pipes.ipynb
pivot_versus_pivot_table.ipynb	pivot_versus_pivot_table.ipynb

Name

Last commit message

Last commit date

data_generation.ipynb

from_long_to_wide_and_back_again.ipynb

generate_csv_files.py

indexing_and_querying.ipynb

missing_values.ipynb

numba_and_pandas.ipynb

pandas_datatypes.ipynb

pivot_versus_pivot_table.ipynb

Pandas

pandas is a library that defines three data structures and algorithms that are useful in the context of data analysis and data science. It represents Series, DataFrame, and Panel, or 1D, 2D, and 3D arrays. DataFrame is especially useful, and defines methods such as pivot_table, and query, and has many facilities to deal with missing data.

For analysis purposes, pandas has some nice plotting features that are easy to use.

What is it?

agt_analysis.ipynb: a notebook illustrating the analysis and visualization of water levels as measured by variouus sensors.
agt_data: three CSV files using in the notebook.
data_generation.ipynb: notebook that generates some simulated gene expression data using numpy and 'pandas`.
pandas_intro.ipynb: illustrates various aspects of using pandas such as importing data, using Series, DataFrame, cleaning and formatting data, dealing with missing data, adding and removing columns, and various algorithms and visualizations.
data: some data sets used in the notebook above.
patients.ipynb: runninng example used in the Python slides.
patient_data.ipynb: extended version of therunninng example used in the Python slides.
pipes.ipynb: consolidating data processing using pipes.
screenshots: screenshots made for the slides.
generate_csv_files.py: script to generate CSV files in different formatg.
copy_on_write.ipynb: Jupyter notebook that illustrates how data is shared between related notebooks and the role Copy-on-Write plays in order to prevent accidental data modifications in more than one dataframe.
apply.ipynb: Jupyter notebook that illustrates the use of the apply method in pandas dataframes for applying functions along rows or columns. It includes a comparison of performance between using apply and vectorized operations.
numba_and_pandas.ipynb: Jupyter notebook that demonstrates how to use Numba to optimize performance of operations on pandas dataframes.
from_long_to_wide_and_back_again.ipynb: Jupyter notebook that illustrates how to reshape data using stack and pivot methods in pandas.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Pandas

What is it?

FilesExpand file tree

pandas

Directory actions

More options

Directory actions

More options

Latest commit

History

pandas

Folders and files

parent directory

README.md

Pandas

What is it?