In A Nutshell
Contribute to the Eviction Lab at Princeton University’s mission to create data and research products to help researchers, policymakers, and community members understand the eviction crisis.
Responsibilities
- Improving existing code base: reviewing code base; designing tests to assess data quality; designing tests to assess speed and identify bottlenecks; rewriting code to optimize speed and quality, and remove extraneous operations.
- Developing a data pipeline for new datasets: preprocessing data to conform to uniform data standards; identifying missing data and making appropriate imputations; running standardized data through data construction pipeline; identifying and fixing bugs; assessing resulting data products for accuracy and completeness.
- Leading the development of new data features and products: constructing new measures and assessing them for accuracy; incorporating new types of data and making measures based on them.
Skillset
- Bachelor’s degree or equivalent.
- 3+ years of relevant experience.
- Extensive experience writing data pipelines written in Python, specifically Pandas and GeoPandas.
- Extensive experience working with large datasets.
- Familiarity with mapping and geographic data processing.
- Familiarity with Git.
- Demonstrated ability to work independently.
- Knowledge of regular expressions (regex).