In A Nutshell
Lead the design, development, and deployment of Wikidata’s query platform architecture.
Responsibilities
- Stability, performance, and scalability of the Wikidata Query Service (WDQS) architecture and data pipeline.
- Articulating a vision for Wikidata Platform’s query infrastructure that supports continued growth and future sustainability.
- Developing new query methods, APIs, algorithms, and indexing strategies to optimize graph search capabilities for priority use cases.
- Collaborating with the Staff Product Manager, Engineering Manager, and other cross-functional colleagues to design system requirements and ship iterative improvements to meet user needs.
- Maintaining an understanding of current developments in structured knowledge representation technologies in order to propose innovative solutions.
- Persevering through setbacks to ensure team goals are met, or communicating when a pivot may be necessary.
- Developing an understanding of our movement and how it drives our work.
- Developing best practices for interacting with platform query services.
- Performing data analysis to uncover insights and patterns.
Skillset
- 8+ years of experience building and scaling API-driven data platform products with technical userbase.
- 4+ years of experience in data engineering, specifically with production deployments at scale.
- Deep understanding of database and knowledge graph representation technologies and standards.
- Proficiency in Java, C++, or other programming languages for database interactions.
- Ability to set up, scale, and investigate systems is more important than expertise in a particular language.
- Experience navigating issues associated with privacy-sensitive data and familiarity with security best practices in implementing database query services.
- Past success in breaking down ambiguous projects into clear tasks.
- Ability to work with multiple stakeholder teams to deliver results through lateral influence and collaboration.
- Knowledge of highly scalable data processing frameworks (Spark, Kafka, Flink, etc.)