Maxim Keremet

Maxim Keremet

Data Engineer · MLOps

🇫🇮 Helsinki, Finland  |  Blue card holder

Skills


Programming & data processing Python, Java (Spring Boot), SQL, Spark (PySpark, Scala Spark, MLlib), Bash
Pipelines & orchestration Airflow, Dagster, Mage
Streaming & real-time Kafka, AWS Kinesis, Flink, Druid
Storage & warehousing Snowflake, ClickHouse, PostgreSQL, Hadoop (HDFS, Hive), Iceberg, Alembic, S3
Visualization & BI Superset/Preset, Dash, Streamlit, Grafana
Cloud platforms AWS, GCP, Yandex.Cloud
DevOps & infra Docker, Kubernetes, Helm, Terraform
Observability & reliability Prometheus, Datadog, Sentry
MLOps & experimentation MLflow, Feast

Experience


Next Games, a Netflix Games studio Helsinki, Finland
Senior Data Engineer Oct 2024 – Current
  • Being a part of DSE (Data Science and Engineering) team, contributing to data engineering practices and backend services.
  • Maintaining and evolving core services and infrastructure for game studio streaming analytics.
  • Building analytics-oriented batch pipelines for game analysts and infrastructure-oriented pipelines for monitoring game and services health, AB testing, etc.
  • Contributing to and supporting analytics tools for internal stakeholders.
  • Refactoring services from Python to Java Flink services to align with Netflix paved roads and infrastructure standards.
Wolt Helsinki, Finland
Data Engineer / Software Engineer Aug 2022 – Sep 2024
  • Building infrastructure for a batch processing tool based on Airflow.
  • Maintaining and enhancing an existing streaming tool.
  • Collaborating on streaming platform team initiatives with Scala and Iceberg table format.
  • Managing and contributing to infrastructure integrations between the batch platform and Snowflake.
  • Performing data operations and ad-hoc tasks such as creating data models, Snowflake tables, Kafka connectors, etc.
  • Improving observability for internal tools using DataDog by building dashboards, monitors, and logging collections.
  • Developing SRE processes for data services, including maintaining documentation, troubleshooting incidents, and conducting postmortems.
  • Developing an internal declarative data workflow definition tool for data professionals (50+ internal users).
  • Contributing to all aspects of the application: internal logic, CI/CD, infrastructure, metrics collection, monitoring and alerting, Snowflake integrations, and user documentation.
  • Designing and leading the migration process from a legacy solution to a new workflow.
X5 Retail Group (largest Russia food retailer) Moscow, Russia
Lead Data Engineer / MLOps Feb 2021 – May 2022
  • Built ML features collection ETL pipelines (Feature Store).
  • Built a Python package serving as API for analysts, data scientists, and production for easy access to ML features.
  • Crafted models retraining processes — scheduled retraining in production and on-demand for different use cases.
  • Built fully automated CI/CD pipelines on GitlabCI for containerized applications using Docker, Helm and Rancher.
  • Designed overall pipeline architecture for ML models in production and scaled batch model inference for over 50M users.
  • Deployed tools for the team: Airflow, Superset, Datahub.
  • Built a data monitoring pipeline (PySpark + Airflow + Superset + Postgres) gathering metrics for all tables with ML features and model results.
VK.com (social networking service) St. Petersburg, Russia
Data Engineer / Product Analyst Feb 2020 – Nov 2020
  • Built pipeline for collecting product metrics for all analytical use cases.
  • Performed analytical ad-hoc analysis tasks (EDA, product and tech performance dashboards, coronavirus reporting, etc.).
  • Improved AB testing framework methodology (researching CUPED and different statistical test approaches), developing custom frontend and backend on Dash and ClickHouse.
X5 Retail Group (largest Russia food retailer) Moscow, Russia
Data Engineer Nov 2018 – Feb 2020
  • Built assortment matrix optimizer as part of a web-based platform for category managers.
  • Full stack development from prototype to production: business logic, testing, documentation, logging, alerting, debugging, API integration with frontend services and Kafka, deploying in Kubernetes.
Mail.ru Group Moscow, Russia
Data Analyst Jul 2017 – Nov 2018
  • Performed analytical ad-hoc analysis tasks.
  • Performed typical DS workflow: data cleaning → exploring data → building machine learning models → performing ML evaluations.

Extracurricular Activity


Yandex.Practicum Remote
MLOps Course Tech Lead / Author Feb 2025 – Jun 2025
  • Constructed curriculum on the module/lesson level, educational results, practical assignments and other program-associated docs.
  • Hiring and managing a group of 5+ authors to produce educational content, assignments and infrastructure for the program.
  • Hiring support team members — course mentors and assignment reviewers.
  • Acting as an author and contributing to the program.
  • Collaborating with devops team to provision, set up course infrastructure and optimizing cloud costs per student.
Central University by Tinkoff Remote
Course Lead Jan 2024 – Current
  • Designed a bachelor's course in Python with profile tracks for Data analyst, ML engineer and SWE.
  • Leading a group of authors, contributing with content myself, designing student learning experience, and building the production side of the course.
  • Contributing to data engineering course.
Practicum by Yandex Remote
Course Mentor and Contributor Jul 2019 – Current
  • Contributed with a 30-hour course module on Hadoop, PySpark and ML in Spark.
  • Mentored over 1,000 students on their code.
Open ML course Remote
Contributor and TA Sep 2018 – Dec 2019
  • Preparing lessons and materials for students of mlcourse.ai.
  • Giving guidance on assignments, answering questions and making tutorials.

Education


University of Gothenburg
M.S. in Medicine
Gothenburg, Sweden
Sep 2012 – Aug 2014
Jönköping University
Bachelor of Business Administration (B.BA.)
Jönköping, Sweden
Sep 2010 – Aug 2011
Plekhanov Russian University of Economics
B.S. in Economics
Moscow, Russia
Sep 2008 – Aug 2012