GitHub πŸ±β€πŸ’» | Google Scholar πŸ“š | StackOverflow πŸ–₯️ | LeetCode 🧩 | LinkedIn πŸ”—

ML Software Engineer πŸ€–

Passionate ML/DL/CV/NLP Engineer with 3+ years of industrial and 2+ years of academic experience delivering impactful solutions across various industries. Proficient in supervised, self-supervised, and transfer learning, with in-depth experience in OCR, object detection, segmentation, tracking, video recognition, and action classification.

Skilled in developing and deploying machine learning models on AWS and GCP, building and optimizing pipelines, containerization, and collaborating with cross-functional teams to drive business growth.

Skills Summary πŸ› οΈ

  • Programming Languages: Python, C/C++, Java.
  • Database Management: MySQL, PostgreSQL, PySpark.
  • ML: Numpy, Scikit-learn, PyTorch, PyTorch Lightning, Tensorflow, Keras, HuggingFace, Transformers.
  • MLOps: Docker-compose, Dockerization, Kubeflow, MLFlow, Flask, Fast API, gRPC, TorchServe, Triton, TensorRT.
  • Development Tools: Git/Github, Docker, CI & CD.
  • Cloud: AWS EC2, GCP.
  • Main Competencies: Object Detection, Object Tracking, OCR, Clustering, Re-Identification, Medical Imaging, Image Restoration & Enhancement, DeepFakes, Generative Models, Vision-Language Models, Large-Language Models, Natural Language Processing, Building End-to-End Pipelines, Deployment Pipelines, GCP Deployment.

Work Experience πŸ’Ό

AI Research Engineer 🧠

Aria Studios Co. Ltd | March 2024 - Present

  • LG Ground 220: Developed the AI backend for MusicStudio and DJingStudio, featuring lyrics generation from user input, cover image creation, and music generation using OpenAI, StabilityAI, and MixAudio APIs.
  • GPT Fine-tuning: Fine-tuned the GPT-3.5-turbo model on conversation data of a virtual character to build a custom API for the virtual assistant. Performed several data augmentations using the OpenAI API for paraphrasing conversations.
  • LLMs Deployment: Deployed a lightweight Phi-3 model for emotion detection from text on GCP using FastAPI and prepared a deployment container. The main objective was to detect users’ emotions while interacting with a virtual character and responding accordingly.
  • VLMs Deployment: Developed an β€œeye” for a virtual character to see and understand its surroundings, enabling it to interact with users. Deployed the Phi-3-Vision model on GCP for this purpose.
  • Face Parsing: To improve face swapping performance, employed a Face Parsing model to segment the face. Enhanced model performance by modifying the feature extractor (backbone) and training strategy. Implementation can be found here.
  • Image enhancement & Face restoration & DeepFake: Worked on image enhancement and face restoration to improve DeepFakes. Created a DeepFake video for KBS election process coverage. Can be seen here.

ML Engineer πŸ–₯️

Pyler Co. Ltd | July 2022 - September 2023

  • Video-based Visual Content Moderation: Built a Video Moderation Pipeline to flag inappropriate video content using video recognition models, achieving over a 10% improvement in model accuracy.
  • Detection-based Visual Content Moderation: Utilized segmentation and detection techniques to precisely detect unsuitable content for brand safety. Implemented state-of-the-art models in terms of real-time speed and efficiency, improving model precision and recall by around 15% through active learning techniques. Built an end-to-end pipeline on Kubeflow for training and deployment.
  • Classification-based Visual Content Moderation: Leveraged multi-label and multi-head classification techniques to improve precision by approximately 20% using self-supervised and supervised training approaches. This novel approach showcases the adaptability and efficacy of the model for hard samples. Prepared Docker images for each development and deployment environment (containerization).

AI Research Engineer 🧠

D-Meta Co. Ltd | November 2020 - July 2022

  • Slab Text Recognition: Developed and designed a text detection and recognition model to efficiently recognize handwritten texts on slab metals using Spatial Transformer Networks and Sequential modeling. Built a complete pipeline from data pre-processing to training and evaluation of the model. Achieved over 90% accuracy by integrating state-of-the-art detection and recognition models for scene text images.
  • Automatic Number Plate Recognition: Designed and developed an ANPR model to accurately detect and recognize number plates. Leveraged active learning and synthetic image generation techniques to improve precision and recall by around 15%.
  • Car Damage Detection: Built a lightweight damage detection model and deployed it on an Android device using TorchScript. Improved the precision of the model by around 10% by tuning the model parameters.

Research Experience πŸ“š

Research Assistant πŸ§‘β€πŸ”¬

AI and SC Lab | Sep 2018 - Nov 2020

  • Computer Vision based Fire and Smoke Detection: Designed and implemented a dilated CNN architecture for improved feature extraction and recognition in images/videos. By carefully tuning and optimizing the model, achieved a high level of accuracy in fire and smoke detection, reducing false positives and achieving 1.5x faster inference speed compared to the fastest counterpart.
  • Model Optimization for Edge Devices: Improved the FPS on Edge device (Raspberry PI 2) by using hyper-parameter tuning and quantization for the detection model.

Education πŸŽ“

Institution Degree Duration
Gachon University MSc in Computer Engineering; advised by Prof. Young Im Cho; GPA: 4.01/4.5 Sep 2018 - Feb 2021
Tashkent University of Information Technologies BSc in Computer Engineering; GPA(%): 85/100 Sep 2014 - Jun 2018

Publications πŸ“

  • Valikhujaev Y, Abdusalomov A, Cho YI. Automatic Fire and Smoke Detection Method for Surveillance Systems Based on Dilated CNNs. Atmosphere, IF 2.9. 2020; 11(11):1241. https://doi.org/10.3390/atmos11111241.
  • Muksimova SH, Valikhujaev Y, Cho YI. Automatic Fire and Smoke Detection System for Open Street CCTV Systems in Smart City Platforms. Korean Society of Information Scientists and Engineers, 412-414 pages, Domestic Conference.

Honors πŸ†

  • Best paper award from Fire Investigation Society of Korea (FISK); (Domestic Conference, 2020)
  • Best presentation award from ISIS2019 & ICBAKE2019; (Domestic Conference, 2019)

Languages 🌐

  • English: Full Professional Proficiency (C1 Advanced)
  • Korean: Limited Working Proficiency (B1 Pre-Intermediate)
  • Uzbek: Native Proficiency
  • Russian: Limited Working Proficiency


Last Updated: 2024-06-20