pdf | markdown

Work Experience

Lead ML Engineer, Domclick | March 2020 – September 2021

As a tech lead of the computer vision team,

  • Designed and led development of a unified recognition pipeline which was employed in 8 services, superseded 4 legacy pipelines, and decreased new model development time by 70%
  • Spearheaded the efforts to move 20+ production models from TensorFlow 1.X to PyTorch, and on the internal MLOps platform, improving both model reproducibility, maintainability, and quality metrics
  • Architected a NER-based data extraction pipeline with 3 new libraries expanding automatic data capture on large texts
  • Managed technical backlog, coordinated team development efforts, and provided technical guidance to 5 team members
  • Ran regular OCR/CV/NLP seminar to fill the experiments backlog and generate ideas for model improvement; authored 25% of the 2000+ training jobs and managed all 60+ model deployments generated by the team in a year
  • Drove development of a core dataset (1m+ labeled image crops, 50+ revisions) and related data access libraries; instructed the annotation team

Technologies: Python, PyTorch, numpy, OpenCV, flairNLP, aiohttp, RabbitMQ, S3, PostgreSQL, git, CI/CD, Docker, Kubernetes, LMDB

Senior ML Engineer, Domclick | May 2018 – March 2020

As a member of the data science team at a real estate marketplace company,

  • Implemented and deployed recognition service which increased a number of automatically processed applications by 30%
  • Built 3 computer vision models for the moderation team (fraud detection models, a room type classification model) to reduce manual work
  • Redesigned document validation service with a custom multi-task network architecture decreasing response time by 60%
  • Extended and overhauled document classification pipeline with 40 new document types while reducing inference speed by 50%
  • Developed and maintained watermark insertion library applied to every property image across the marketplace

Technologies: Python, TensorFlow, Keras, numpy, OpenCV, Sanic, PostgreSQL, ClickHouse, git, CI/CD, Kubernetes, Grafana, InfluxDB

Data Scientist, Flocktory | June 2015 – May 2018

First hire with focus on machine learning for a Marketing-as-a-Service startup working with e-commerce businesses. Built

  • Coupon recommendation system based on clickstream data which increased product revenue by 15%
  • Purchase prediction service that spanned 500+ product categories and 50+ sites and was used for customer segmentation
  • Send time optimization model to schedule marketing campaigns, bringing 5-10% increase in CTR in A/B tests on average
  • Site exit intent detection model which reduced notification volume by 70% without increasing bounce rate

Technologies: Python, SQL, R, AWS, xgboost, scikit-learn, pandas, Vowpal Wabbit, fastText, Vertica, DynamoDB, Kafka, git

Quantitative Risk Analyst, AK BARS Finance | May 2013 – May 2015

  • Created portfolio return distribution modeling tool based on historical time series data empowering value-at-risk analysis
  • Engineered trading terminal plugin for real-time portfolio risk assessment to enable automatic company-wide risk alerts
  • Automated portfolio stress testing simulation to save 4 man-days each month

Technologies: R, tseries, copula, GARCH, Lua, Excel VBA

Education

Lomonosov Moscow State University | September 2007 – July 2012

Specialist degree in Mathematics and System Programming (M.Sc. in Computer Science)