Code


This is where most of my typing goes to.

Open source Python projects

pdpipe - Easy pipelines for pandas DataFrames. [website] [github]

pulearn - Positive-unlabeled learning with Python. [website] [documentation] [github]

skift - scikit-learn wrappers for Python fastText.

cachier - Persistent, stale-free memoization decorators for Python.

stationarizer - Smart, automatic detection and stationarization of non-stationary time series data.

birch - Simple hierarchical configuration for Python packages.

s3bp - Read and write Python objects to S3, caching them on your hard drive to avoid unnecessary IO.

holcrawl - A crawler for building Hollywood movies datsets.

morejson - A drop-in replacement for Python’s json module that handles additional built-in and standard library Python types.