Вакансію закрито. Перегляньте інші доступні вакансії
25.06.2025
Вакансія Senior Data Engineer (AI and ML frameworks)
![]() |
|
Компанія: | Sigma Software |
---|---|
Домен вакансії: | Healthcare / MedTech / LifeScience, Machine Learning / Big Data |
Досвід роботи: | Не має значення |
Рівень спеціаліста: | Senior |
Рівень англійської: | Не має значення |
Тестове завдання: | Є тестове |
Зайнятість: | Full-time |
Релокейт: | Без релокейту |
Місце роботи: | Office, Remote |
Локація: | Україна |
Skills
- Kafka
- JSON
- Avro
- Python
- Apache Flink
- Kubernetes
- Helm
- GKE
- GCP
- Confluent Kafka
- SQL
- RDBMS
- NoSQL
- Neo4j
Опис вакансії
We are looking for a talented Senior Data Engineer with a strong background in developing or contributing to applications based on microservices using a Kappa architecture. The project aims to unify data sourced from different EHR systems in the healthcare domain, using the FHIR data format.Customer
Our client is a leading analytics company operating at the intersection of technology, artificial intelligence, and big data. They support manufacturers and retailers in the fast-moving consumer goods sector, helping them better understand market dynamics, uncover consumer behavior insights, and make data-driven business decisions.Project
The project aims to unify data sourced from various EHR systems in the healthcare domain using the FHIR data format. The company’s proprietary technology platform combines high-quality data, deep industry expertise, and advanced predictive algorithms built over decades of experience in the field.Requirements
- Deep understanding of patterns and software development practices for event-driven architectures
- Hands-on experience with stateful stream data processing solutions (Kafka or similar streaming platforms)
- Strong knowledge of data serialization/deserialization using various data formats (at minimum JSON and Avro), and integration with schema registries
- Proven Python software development expertise, with experience in data processing and integration (most of the software is written in Python)
- Practical experience building end-to-end solutions with Apache Flink or a similar platform
- Experience with containerization and orchestration using Kubernetes (K8s) and Helm, especially on Google Kubernetes Engine (GKE)
- Familiarity with Google Cloud Platform (GCP) or a similar cloud platform
- Hands-on experience implementing data quality solutions for schema-on-read or schema-less data
- Hands-on experience integrating with Apache Kafka, particularly the Confluent Platform
- Familiarity with AI and ML frameworks
- Proficiency in SQL and experience with both relational and NoSQL databases
- Experience with graph databases like Neo4j or RDF-based systems
- Experience in the healthcare domain and familiarity with healthcare standards such as FHIR and HL7 for data interoperability
Would be a plus:
- Experience with web data scraping
Personal Profile
- Strong problem-solving skills, with the ability to design innovative solutions for complex data integration and processing challenges
- Excellent communication skills, with the ability to articulate complex technical concepts and work effectively with various stakeholders
- Commitment to improving healthcare through data-driven solutions and technology
- Stay abreast of the latest technologies and industry trends while continually improving your skills and knowledge
- Ability to work in a collaborative environment, valuing diverse perspectives and contributing to a positive team culture
Responsibilities
- Data Standardization and Transformation:
- Convert diverse data structures from various EHR systems into a unified format based on FHIR standards
- Map and normalize incoming data to the FHIR data model, ensuring consistency and completeness
- Kafka Integration:
- Consume and process events from the Kafka stream produced by the Data Writer Module
- Deserialize and validate incoming data to ensure adherence to required standards
- Data Segmentation:
- Separate data streams for warehousing and AI model training, applying specific preprocessing steps for each purpose
- Prepare and validate data for storage and machine learning model training
- Error Handling and Logging:
- Implement robust error handling mechanisms to track and resolve data mapping issues
- Maintain detailed logs for auditing and troubleshooting purposes
- Data Ingestion and Processing:
- Use LLMs to extract structured data from EHRs, research articles, and clinical notes
- Ensure semantic consistency and interoperability during data ingestion
- Knowledge Graph Construction:
- Integrate extracted data into a knowledge graph, representing entities and relationships for semantic data integration
- Implement contextual understanding and querying of complex relationships within the knowledge graph (KG)
- Advanced Predictive Modeling:
- Leverage KGs and LLMs to enhance data interoperability and predictive analytics
- Develop frameworks for contextualized insights and personalized medicine recommendations
- Feedback Loop:
- Continuously update the knowledge graph with new data using LLMs, ensuring up-to-date and relevant insights
- Work Closely with Cross-Functional Teams
- Collaborate with data scientists, AI specialists, and software engineers to design and implement data processing solutions
- Communicate effectively with stakeholders to align on goals and deliverables
- Contribute to Engineering Culture:
- Foster a culture of innovation, collaboration, and continuous improvement within the engineering team
Етапи співбесіди
- Apply
- Recruiter prescreen.
- Interview.
- Decision.
- Offer.
Переваги співробітникам
- Work-life balance
- Гнучкий графік роботи
- Медичне страхування
- Освітні програми, курси
- Юридичний супровід
Про компанію Sigma Software
Sigma Software надає високоякісні рішення для розробки програмного забезпечення та ІТ-консультації більш ніж 170 клієнтам по всьому світу. Компанія працює з клієнтами у фінансово-банківській сфері, автомобільній промисловості, ЗМІ та рекламі, телекомунікаціях, кібербезпеці, індустрії азартних ігор, авіації, нерухомості, енергетиці та охороні здоров’я.
Сайт компанії: sigma.software Рік заснування: 2002 Кількість працівників: 1001-5000 Тип компанії: OutsourceКатегорії вакансії
Схожі вакансії
Досвід від 4 років Middle Full-time Intermediate / B1 Office, Remote, Hybrid
Досвід від 5 років Senior Full-time Не має значення Office, Remote Україна, Romania, Spain, Poland
Досвід від 5 років Lead Full-time Не має значення Remote Україна, Bulgaria, Portugal, Spain, Poland
Досвід від 5 років Senior Full-time Не має значення Office, Remote, Hybrid Україна, Poland
Бонус за рекомендацію: $1000
Досвід від 2 років Senior, Lead Full-time Upper-Intermediate / B2 Office, Remote, Hybrid Україна
Бонус за рекомендацію: $2000
Підписуйтесь на наш Telegram, щоб не пропустити свіжі вакансії.