Code
This is where most of my typing goes to.
Open source Python projects
pdpipe - Easy pipelines for pandas DataFrames. [website] [github]
pulearn - Positive-unlabeled learning with Python. [website] [documentation] [github]
skift - scikit-learn wrappers for Python fastText.
cachier - Persistent, stale-free memoization decorators for Python.
stationarizer - Smart, automatic detection and stationarization of non-stationary time series data.
birch - Simple hierarchical configuration for Python packages.
s3bp - Read and write Python objects to S3, caching them on your hard drive to avoid unnecessary IO.
holcrawl - A crawler for building Hollywood movies datsets.
morejson - A drop-in replacement for Python’s json module that handles additional built-in and standard library Python types.