We’re building a cutting-edge system that combines LLM agents, advanced data analytics, and cybersecurity research. Backed by leading global VCs and led by seasoned founders with multiple successful exits, we’re assembling a top-tier engineering team to shape the future of data-driven security technology.
We’re looking for a skilled Software Engineer to join our Data Platform team. In this role, you’ll be responsible for designing and implementing a highly scalable data infrastructure capable of processing and analyzing massive volumes of connections and events daily. This foundation will support real-time and batch analytics, ML workflows, and LLM-based models.
Responsibilities
- Design and develop data lakes, data warehouses, and scalable data pipelines
- Build robust ingestion and transformation processes for large, complex datasets
- Manage and automate workflows using modern orchestration tools
- Optimize performance of distributed databases, including indexing, partitioning, and query tuning
- Ensure the platform can scale while maintaining low-latency performance
- Work closely with data scientists and cybersecurity researchers to support ML and LLM features
- Enable real-time servicing of ML models and retrieval-augmented generation (RAG) pipelines
- Drive high-quality, well-tested code with strong CI/CD practices
- Stay ahead of industry trends in data engineering, ML infrastructure, and cloud services
Requirements
- Proven experience (4+ years) in designing and building large-scale data platforms using cloud technologies (AWS/GCP/Azure)
- Strong backend development background in data-intensive environments
- Solid knowledge of data modeling and scalable architecture for both batch and real-time systems
- Hands-on experience with streaming technologies like Kafka, Flink, or Spark Streaming
- Deep understanding of modern databases (SQL, NoSQL, Graph) and performance optimization
- Familiarity with data security, encryption, and governance best practices
- Experience with microservices, Kubernetes, and Terraform – an advantage
- Experience with orchestration tools such as Dagster or Apache Airflow – an advantage