teaching:progappchim:pandas

Différences

Ci-dessous, les différences entre deux révisions de la page.

Lien vers cette vue comparative

Les deux révisions précédentes Révision précédente
Prochaine révision
Révision précédente
Prochaine révisionLes deux révisions suivantes
teaching:progappchim:pandas [2019/11/22 11:49] villersdteaching:progappchim:pandas [2021/09/14 04:10] villersd
Ligne 12: Ligne 12:
   * [[http://pandas.pydata.org/pandas-docs/version/0.20/cookbook.html|cookbook]]   * [[http://pandas.pydata.org/pandas-docs/version/0.20/cookbook.html|cookbook]]
   * [[http://pandas.pydata.org/pandas-docs/stable/user_guide/visualization.html|Visualisation]]   * [[http://pandas.pydata.org/pandas-docs/stable/user_guide/visualization.html|Visualisation]]
 +  * [[https://www.datacamp.com/community/blog/python-pandas-cheat-sheet|Pandas Cheat Sheet for Data Science in Python]]
  
 ===== Applications, exemples ===== ===== Applications, exemples =====
Ligne 55: Ligne 56:
   * autres représentations   * autres représentations
   * différentiation suivant le genre, l'âge   * différentiation suivant le genre, l'âge
 +  * utiliser d'autres fonctions, comme nsmallest() et nlargest(), value_counts() (se baser sur la documentation officielle)
   * ...   * ...
  
Ligne 170: Ligne 172:
 df.Weight.plot.hist() df.Weight.plot.hist()
 </code> </code>
 +
 +===== Interface utilisateur graphique =====
 +  * [[https://github.com/adamerose/pandasgui|PandasGUI]]
 +    * [[https://towardsdatascience.com/pandasgui-analyzing-pandas-dataframes-with-a-graphical-user-interface-36f5c1357b1d|PandasGUI: Analyzing Pandas dataframes with a Graphical User Interface - Accessing Pandas Dataframes with a simple click of the mouse]] Parul Pandey, Medium, 24/10/2020
 +  * [[https://github.com/tkrabel/bamboolib|bamboolib]] (closed source - non libre)
  
 ===== Références ===== ===== Références =====
   * [[https://www.tutorialspoint.com/python_pandas/index.htm]]   * [[https://www.tutorialspoint.com/python_pandas/index.htm]]
 +  * [[https://towardsdatascience.com/top-4-repositories-on-github-to-learn-pandas-1008cb769f77|Top 4 Repositories on GitHub to Learn Pandas - Some of the most popular repositories to brush up on Pandas for beginners and experts alike]] Byron Dolon, Medium, Jul 21, 2020
 +    * [[https://github.com/guipsamora/pandas_exercises|GitHub - guipsamora/pandas_exercises: Practice your pandas skills!]]
 +    * [[https://github.com/justmarkham/pandas-videos|GitHub - justmarkham/pandas-videos: Jupyter notebook and datasets from the pandas Q&A video series]]
 +    * [[https://github.com/ajcr/100-pandas-puzzles|GitHub - ajcr/100-pandas-puzzles: 100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)]]
 +    * [[https://github.com/justmarkham/pycon-2019-tutorial|GitHub - justmarkham/pycon-2019-tutorial: Data Science Best Practices with pandas]]
   * [[https://medium.com/@devopslearning/introduction-to-pandas-for-data-analysis-c14bb9b1c21b]] (limité)   * [[https://medium.com/@devopslearning/introduction-to-pandas-for-data-analysis-c14bb9b1c21b]] (limité)
   * [[http://www.firstpythonnotebook.org/index.html|First Python Notebook. A step-by-step guide to analyzing data with Python and the Jupyter Notebook]] (The course will teach you how to use pandas to read, filter, join, group, aggregate and rank structured data. You will also learn how to record, remix and republish your analysis using the Jupyter Notebook) → commencer au chapitre 3 "Import pandas into a Jupyter Notebook"   * [[http://www.firstpythonnotebook.org/index.html|First Python Notebook. A step-by-step guide to analyzing data with Python and the Jupyter Notebook]] (The course will teach you how to use pandas to read, filter, join, group, aggregate and rank structured data. You will also learn how to record, remix and republish your analysis using the Jupyter Notebook) → commencer au chapitre 3 "Import pandas into a Jupyter Notebook"
Ligne 187: Ligne 199:
   * [[http://cmdlinetips.com/2018/03/how-to-make-boxplots-in-python-with-pandas-and-seaborn/|How to Make Boxplots in Python with Pandas and Seaborn?]] (and Gapminder dataset)   * [[http://cmdlinetips.com/2018/03/how-to-make-boxplots-in-python-with-pandas-and-seaborn/|How to Make Boxplots in Python with Pandas and Seaborn?]] (and Gapminder dataset)
   * [[https://www.geeksforgeeks.org/box-plot-visualization-with-pandas-and-seaborn/|Box plot visualization with Pandas and Seaborn]]   * [[https://www.geeksforgeeks.org/box-plot-visualization-with-pandas-and-seaborn/|Box plot visualization with Pandas and Seaborn]]
 +  * [[https://towardsdatascience.com/complete-guide-to-data-visualization-with-python-2dd74df12b5e|Complete Guide to Data Visualization with Python]] (avec différentes librairies : matplotlib, Seaborn, bokeh, Altair, Folium avec des cartes,...)
 +  * [[https://towardsdatascience.com/introducing-bamboolib-a-gui-for-pandas-4f6c091089e3|Introducing Bamboolib — a GUI for Pandas]] (utilisation gratuite ou payante via une activation nécessaire dans jupyter)
 +    * [[https://bamboolib.8080labs.com/]]
 +    * [[https://github.com/tkrabel/bamboolib]]
 +  * [[https://towardsdatascience.com/violin-plot-its-time-to-ditch-the-box-plots-785629b0ff3a|Violin Plot — It’s Time to Ditch the Box Plots]]
 +  * [[https://towardsdatascience.com/reshape-pandas-dataframe-with-melt-in-python-tutorial-and-visualization-29ec1450bb02|Reshape pandas dataframe with melt in Python — tutorial and visualization]]
 +    * [[https://pandas.pydata.org/docs/reference/api/pandas.melt.html|pandas.melt]] (documentation)
 +  * groupby :
 +    * [[https://towardsdatascience.com/learn-how-to-master-groupby-function-in-python-now-4620dd463224|Learn how to master groupby function in Python now | by WY Fok | Towards Data Science]]
 +    * [[https://towardsdatascience.com/less-known-pandas-groupby-applications-in-python-72fb304a1973|Less known Pandas groupby applications in Python | by WY Fok | Aug, 2020 | Towards Data Science]]
 +  * remplacer Excel (ou calc de libreoffice)
 +    * [[https://towardsdatascience.com/how-to-ditch-excel-a-primer-to-python-c4a9c192e993|Ditch Excel! — A Primer to Python - Pandas one-liners for popular excel stuff]], Medium 01/09/2020
 +  * [[https://towardsdatascience.com/how-not-to-write-pandas-code-2cbda8b3816c|How NOT to write pandas code]]
 +  * [[https://towardsdatascience.com/40-examples-to-master-pandas-c69d058f434e|40 Examples to Master Pandas - A comprehensive practical guide]]
 +  * [[https://towardsdatascience.com/pandas-fundamentals-for-beginners-that-will-help-you-code-like-a-pro-2ff5e34fee09|Pandas fundamentals every data scientist needs to know - To boost your performance and code like a pro]] Mısra Turp, Medium, 07/01/2021
 +  * [[https://python.plainenglish.io/add-some-style-to-your-pandas-dataframe-ae3ed4ac7804|Add Some Style to your Pandas DataFrame - Putting Some Pizzaz into your Data]] Curt Beck; Medium, Oct 11, 2020
 +  * [[https://towardsdatascience.com/spreadsheets-to-python-its-time-to-make-the-switch-ef49cf9463d5|Spreadsheets to Python: it's time]]  Clive Siviour, Towards Data Science, Medium 03/09/2021
 +
  
 ==== Datasets ==== ==== Datasets ====
   * [[https://archive.ics.uci.edu/ml/datasets/Adult|Adult Data Set]] Predict whether income exceeds $50K/yr based on census data. Also known as "Census Income" dataset   * [[https://archive.ics.uci.edu/ml/datasets/Adult|Adult Data Set]] Predict whether income exceeds $50K/yr based on census data. Also known as "Census Income" dataset
   * [[https://archive.ics.uci.edu/ml/datasets.php]]   * [[https://archive.ics.uci.edu/ml/datasets.php]]
 +  * [[https://towardsdatascience.com/7-examples-to-master-line-plots-with-python-seaborn-42d8aaa383a9|7 Examples to Master Line Plots With Python Seaborn - Practical data visualization guide]] Soner Yıldırım, 12/09/2021 → Seaborn + Pandas + librairie [[https://pandas-datareader.readthedocs.io/en/latest/|pandas datareader]] donnant accès à de nombreuses données (bourses, OECD, Eurostat,...)
 +
  
 ==== Exemples divers ==== ==== Exemples divers ====
Ligne 198: Ligne 230:
   * régressions linéaires :   * régressions linéaires :
     * [[http://www.xavierdupre.fr/app/ensae_teaching_cs/helpsphinx/notebooks/td2a_eco_regressions_lineaires.html]]     * [[http://www.xavierdupre.fr/app/ensae_teaching_cs/helpsphinx/notebooks/td2a_eco_regressions_lineaires.html]]
 +  * [[https://towardsdatascience.com/effectively-visualize-data-across-time-to-tell-better-stories-2a2c276e031e|Effectively visualize data across time to tell better stories]] (Pandas & Plotly)
 +  * [[https://medium.com/analytics-vidhya/statistical-analysis-in-python-using-pandas-27c6a4209de2|Statistical Analysis in Python using Pandas]] Tanvi Penumudy, Medium, dec 31, 2020
 +  * [[https://towardsdatascience.com/an-ultimate-cheat-sheet-for-data-visualization-in-pandas-f7bc239c9937|An Ultimate Cheat Sheet for Data Visualization in Pandas - All the Basic Types of Visualization That Is Available in Pandas and Some Advanced Visualization That Are Extremely Useful and Time Savers]] Rashida Nasrin Sucky, Medium, 15/02/2021 (dataset, jupyter)
 +
 +
  
  • teaching/progappchim/pandas.txt
  • Dernière modification : 2022/11/15 10:08
  • de villersd