Statistical Computing in Python and R
Published:
Notebooks and reference for most routine tasks in data-management and econometrics in R and Python [typically written in jupyter notebooks / forked and exported with to HTML H1-H4 headers for easy reference using the html-toc extension].
R
- Data munging
- DT ~ dplyr
- data.table
- ggplot notes (based on Kieran Healy’s draft book)
- spatial (based on the in-progress Pebesma-Bivand Book)
- MHE Replication / Applied Econometrics Notes -
- Notes from Classes - temporarily out of commission
- Old
- Sqlite / Google Bigquery with R
- Tidyverse Non standard evaluation [outdated; use rlang instead]
Python
- Numpy
- Python data science
- QuantEcon Data science link
- Tom Augspurger’s Modern Pandas
- basics notebook;
- gist reference
- Econometrics
- Maths etc
- Spatial
- Overview / Teaching Notebook
- Intermediate
- Vector Data + Nunn 2009 replication
- Conda environment specifications for working GIS stack gist
- Dan Arribas’ Geographical Data Science notes
- GDS Book-in-progress
Julia
General
Translation notes
- Data.table ~ Pandas
- Dplyr ~ Pandas
- Stop using Stata [open science and proprietary software are fundamentally antithetical]