A 2021 VC backed startup is transforming the patent system using AI, handling every patent document ever created (15+TB of data and 500M+ embeddings). We’re looking for a Senior Software Engineer to own the data and machine learning infrastructure.
What You’ll Do:
- Build and maintain scalable ML infrastructure and data pipelines.
- Handle massive vector databases and embeddings (≥500M vectors).
- Architect distributed systems that process 20M+ API requests daily.
- Optimize cloud infrastructure on Google Cloud for performance and cost.
- Own end-to-end ETL and ML pipelines processing multi-TB datasets.
Tech Stack:
Python | GCP | Postgres / pgvector | Kubernetes | Celery | RunPod | CUDA
Why Join:
- Work with massive, industry-defining datasets.
- Be part of an early-stage startup with high exit potential.
- Join a non-toxic, flexible culture (flexible & empathetic Founders, no weekend crunch).
- Competitive salary $250K + equity, with the opportunity to shape the future of legal tech.
Who You Are:
- 5–12 years of experience in backend, data, or ML infrastructure engineering (with GCP).
- Proven track record scaling and optimizing distributed systems / ML infrastructure.
- Experienced in GPU training, CUDA debugging, and cloud cost optimization.
- CS degree from a top 20 university preferred.
- Enjoys end-to-end ownership in early-stage startups, not just big-tech specialisation.
Apply now if you’ve built ML pipelines at scale and want ownership of infrastructure that powers cutting-edge AI.
Are you looking for remote jobs near your area? At Yulys, thousands of employers are looking for exceptional talent like yours. Find a perfect job now.