Перегляд резюме

Middle Data Scientist

400000 ₽

Росiя

Повна

Віддалена робота

Досвід роботи

2 роки 1 місяць

Останнє місце роботи

Wildberries

Data Scientist

2 роки 1 місяць

Про себе

Data Scientist at Wildberries, working on neural recommendation systems and search ranking in production — candidate generation, retrieval, diversity optimization, A/B testing. Currently exploring LLMs and multimodal models — built RAG pipelines, used VLMs for content understanding, and working on a project that combines vision-language models with recommendation architecture. Academic background in biology and biomedical data analysis (MSc at HSE University). Also teach ML courses at HSE and created recommender systems modules at WB Tech School.

Дані відсутні

Травень 2024 - по т.ч.

(2 роки 1 місяць)

Wildberries

Data Scientist

Mitigated recency bias in nearline recommendations by rebuilding the training dataset, switching to a next-item objective, and introducing a multi-target anti-bias loss for candidate generation. Removed RAM bottlenecks by moving to an iterable dataset pipeline. Result: GMV +13% and diversity +1.7% in production A/B testing. Improved recommendation diversity by replacing quota-based post-processing with category-aware retrieval built on multiple FAISS indices split by category, then tuning the diversity–relevance trade-off through five A/B iterations. Result: +15.7% diversity and +1.5–5% on key engagement metrics, followed by production launch. Improved search ranking by sourcing and integrating a broad set of item-level features into the ARGUS ranking model and running systematic ablation studies to measure their impact on NDCG and ROC. Built a cold-start media recommendation baseline by replacing manual curation with clustered SigLIP embeddings used to drive exploration for new users. Result: ATPU +7.2% in A/B testing and production deployment. Automated h2h A/B experiments by turning a manual multi-team handoff into a reproducible workflow that scores the correct user cohort and publishes daily recommendations by experiment ID. Stack: Python, PyTorch, PySpark, FAISS, Airflow, S3, MLflow, Triton.

Портфоліо

https://drive.google.com/file/d/1H9MLWR7x3K3wwt1fLaqkl7aWUzbKognT/view?usp=sharing

A/B тестування

SQL

Python

Машинне навчання

Глибоке навчання

Нейроннi мережi

MLFlow

Airflow

PyTorch

Розробка рекомендацiй

Дані відсутні

Зайнятість

Повна

Формат роботи

Віддалена робота, Гібрид

Графік роботи

Гнучкий, 5/2

Переїзд

Можливий

Відрядження

Відрядження можливі

Middle Data Scientist

Про себе

Досвід в Affiliate

Досвід роботи
2 роки 1 місяць

Навички

Володіння мовами

Зайнятість

Middle Data Scientist

Про себе

Досвід в Affiliate

Досвід роботи2 роки 1 місяць

Навички

Володіння мовами

Зайнятість

Досвід роботи
2 роки 1 місяць