Deine Aufgaben
At Hypatos, we build vertical AI Agents that automate document-heavy, back-office workflows across enterprise functions like finance, compliance, procurement, and customer service. To scale our impact, we are looking for a Senior Data Engineer who will architect and build robust, scalable data infrastructure powering our AI-driven automation systems.
This role is deeply technical and cross-functional: You’ll work closely with AI Engineers, Product, and Delivery teams to design distributed data pipelines, optimize real-time data flows, and ensure our systems are reliable, performant, and secure at scale.
Key Responsibilities
This role is deeply technical and cross-functional: You’ll work closely with AI Engineers, Product, and Delivery teams to design distributed data pipelines, optimize real-time data flows, and ensure our systems are reliable, performant, and secure at scale.
Key Responsibilities
- Design and implement distributed data pipelines using Spark (PySpark), Kafka, and Kubernetes.
- Build and maintain scalable data infrastructure for real-time and batch processing.
- Develop and optimize data models and storage solutions using ClickHouse and vector databases.
- Collaborate with AI engineers to support Retrieval-Augmented Generation (RAG) pipelines and other LLM-powered workflows.
- Ensure data reliability, performance, and security across all environments.
- Contribute to CI/CD workflows, testing frameworks, and monitoring solutions for data systems.
- Translate business requirements into scalable data solutions in collaboration with Product and Delivery teams.
- Mentor junior engineers and promote best practices in data engineering and distributed systems.