Avatar

Sadat Shahriyar

PhD Fall'25 aspirant

ML Engineer @ Samsung R&D Institute

If You Are In A Hurry:

  • ◈ Hi! I am Sadat - PhD Fall'25 Aspirant | MLE @ Samsung R&D Institute | CS Graduate @ BUET
  • ◈ Research interest: NLP, Software Engineering, Reliability of Software Systems and Trustworthy AI
  • ◈ 1+ year of professional experience as an ML Engineer; Designed solutions with LLM, VLM, CV models
  • ◈ Contributed in 3 research projects
  • ◈ ML Instructor at Generative AI training conducted at Samsung R&D Institute
  • ◈ CGPA: 3.75/4.00. Recieved Dean's List Scholarship for academic excellence
  • ◈ Recieved Excellence Award for Innovation from Samsung R&D Institute

If You Have Some Time:

▷ Hi! I am Sadat Shahriyar, an ML Engineer at Samsung R&D Institute. I obtained my B.Sc in Computer Science from Department of CSE, Bangaldesh University of Engineering and Technology (BUET). My research interest broadly lies in the field of Natural Language Processing, Software Engineering, Reliability of Software Systems and Trustworthy AI. My current career goal is to pursue a graduate program in my field of interest.

▷ With over a year of experience as an ML engineer at Samsung R&D Institute, I developed solutions using LLMs, VLMs, Transformers, and Vision models in Software Engineering and Testing domain. I contributed to two projects, contributed to 2 patents, worked as an Instructor in Gen AI training and created multiple proof-of-concept tools. For my contribution to research work, I was awarded the Excellence Award for Innovation from Samsung R&D Institute.

▷ Previously I worked as a Graduate Research Assistant at BUET on a collaborative project with Samsung R&D Institute, supervised by Dr. Anindya Iqbal, Sukarna Barua, and Tahmid Hasan. We studied the effectiveness of Large Language Models in generating and executing test cases for Android applications.

▷ For my undergraduate thesis, supervised by Dr. Anindya Iqbal and Dr. Shahrear Iqbal, we studied the efficacy of language models in detecting Android malware using system call sequences.

▷ In my undergraduate life at BUET, I've regularly participated and excelled at multiple competitions including Dhaka AI 2020 and HackNSU 2020. I have taken leadership roles as an ML Instructor for internal Generative AI training at my workplace and during my undergrad, particularly at departmental events like the annual BUET CSE FEST.

▷ In my leisure time, I enjoy travelling, listening to music and binge-watching sitcoms, tv-series and animes.


Professional Experience

Machine Learning Engineer

R&D Product Development Planning team
Samsung R&D Institute
  • ◈ Developing and managing end-to-end Machine Learning systems, focusing on designing solutions, collecting data and fine-tuning LLMs, VLMs and Computer Vision models for enhanced availability of test automation.
  • ◈ Contribute to Patent and ARF Projects, promoting an innovative culture aligned with the company’s core values.
  • ◈ Coordinated and instructed internal Generative AI training, leading a group of 30 employees.
  • ◈ Collaborating with a supportive team to foster growth and enhance engineering capabilities
  • ◈ Simultaneously managed two projects, contributed to two patents, and created multiple proof-of-concept tools.
  • ◈ Recieved Excellence Award for Innovation from Samsung R&D Institute
Generative AI Data Preparation Prompt Engineering RAG Few-shot prompting Multi-Modal System Grounding LLM VLM Tranformer Computer Vision PEFT/LORA Knowledge Distilation

Graduate Research Assistant

Department of Computer Science and Engineering
Bangaldesh University of Engineering and Technology
  • ◈ Analyzed, designed and developed an "Automated Test Case Generation and Execution Tool" for Android mobile applications.
  • ◈ Designed a Custom Depth First Search algorithm to efficiently traverse an application, write test steps in the process and find UI bugs.
  • ◈ Determined the lack of performance in using Appium in our algorithm and designed solutions for performance optimization.
  • ◈ Fine-tuned multiple Machine Learning models for understanding screen representations and understanding unknown elements. Worked with MaxVit, EfficientNet and Flan-t5 in the process.
  • ◈ Designed data collection protocols, trained and supervised data collectors, and ensured data accuracy and completeness.
Algorithm Development Contrastive Learning Data Preparation Prompt Engineering Few-shot prompting LLM Tranformer Computer Vision Software testing Appium Pytorch

Software Engineer

  • ◈ Implemented new features and customised front-end of online payment portal
  • ◈ Analysed codebase for performance optimization
Dotnet Azzure GIT MVC architecture C# MS Sql



Research Experience

LLM Based Automated Test Case Execution

This research project, conducted at Samsung R&D Institute, focused on evaluating the effectiveness of large language models (LLMs) in executing test cases. We developed a framework that includes a robust approach for identifying the relevant UI elements needed to perform actions based on test cases using LLM, followed by interacting with these elements using Selenium. Subsequently, we applied semantic similarity-based textual matching between the final screen resulting from the UI interactions and the expected outcome defined in the test case to generate a verdict. My specific contributions involved simplifying web pages into a more generalized format for LLM comprehension, conducting few-shot prompting experiments with LLMs (such as, Solar-10.7B, StableBeluga-13B, LLaMa-2-7B), preparing dataset, fine-tuning the LLMs (LLaMa-3-8B, Solar-10.7B) model on annotated dataset for UI element identification and generating semantic similarity-based verdicts using all-MiniLM-L6-v2 (Sentence Transformer model), given the final screen and expected result. Our approach successfully executed 106 out of 110 targeted test cases, achieving an accuracy of 97%.

[White Paper]

LLM Generative AI Transformer LLaMa-3-8B Solar-10.7B all-MiniLM-L6-v2 Prompt Engineering Few-shot prompting Data collection PEFT/QLoRa Unsloth

Automated Test Case Creation from UI Guide using Gen AI

This research project, conducted at Samsung R&D Institute, investigates the efficacy of Large Language Models (LLMs), Vision-Language Models (VLMs), and Computer Vision Models in generating test cases from UI guides. We developed a framework that employs a robust method for detecting functional flows within software from UI guides using Vision-Language Models (VLMs), Large Language Models (LLMs), and Computer Vision models, converting these functional flows into test cases through an algorithmic approach. Additionally, we generated expected outcomes for specific functional flows using VLMs. My contributions included conducting few-shot prompting experiments and fine-tuning VLMs (such as CogVLM and Idefics2) for UI screen identification and functional flow mapping. I also performed UI screen matching with their corresponding descriptions using the LLaMa-3-8B model. Our framework effectively generates test cases from extensive UI guides with minimal error and within a reasonable time frame.

LLM VLM Generative AI Transformer Computer Vision CogVLM Idefics2 yoloV8 LLaMa-3-8B Prompt Engineering Few-shot prompting Data collection PEFT/QLoRa

ML Based Automated Android UI testing and Test Case Generation

This project is a collaboration between Samsung R&D Institute and BUET where we studied the effectiveness of Large Language models in generating and executing test cases for android applications under the supervision of Dr. Anindya Iqbal. We developed an automated UI testing tool for Android applications that can significantly reduce manual testing efforts by 80% by automatically generating test cases. The tool utilizes a custom DFS algorithm using Appium to traverse the application's UI and write test steps. Additionally, EfficientNet and MaxViT are employed for image classification of unknown screen elements and screen similarity matching for maintaining DFS state, while the Flan-T5 generative model is used to understand screen representations and generate expected results for each test step.

[White Paper]

DFS Contrastive learning Transformer Computer Vision EfficientNet MaxViT Flan-T5 Data collection Cosine similarity Appium

Android Malware Detection from System Call Sequence using Language Model

Undergraduate thesis project under the supervision of Dr. Anindya Iqbal and Dr. Shahrear Iqbal (Research Officer, National Research Council (NRC) Canada). In this research, we created a framework to efficiently process sequences of system calls, making them more accessible for language models to identify malicious patterns and detect Android malware. We utilized a Transformer models to analyze these sequences for malware detection. We fine-tuned various language models (BERT, RoBERTa, BigBird, LongFormer) with different sequence lengths and techniques such as supervised fine-tuning and contrastive learning. We then compared their performance against existing models like LSTM, Random Forest, and SVM. Our evaluation demonstrated a 6-7% improvement over the previous state-of-the-art model.

[Thesis dissertation]

Transformer Contrastive learning BERT RoBERTa BigBird LongFormer Data preprocessing Malware detection

Code Review Generation Automation using LLM and Preference Optimization

This is a project under the supervision of Dr. Anindya Iqbal where our aim is to produce more human like reviews and automating review generation procedure through large language models. We are working on the latest RL based preference optimization to finetune state of the art open source LLMs and figure out more suitable evaluation metric to find out human likeliness of code review. The knowledge domains we need in this study are: Code review, LLM inference, LLM finetuning and Objective evaluation.

LLM Code Review Preference Optimization Automation



Education

Bangladesh University of Engineering and Technology (BUET)

Bachelor of Science in Computer Science and Engineering
CGPA: 3.75/4.00

Dhaka Residential Model College

Higher Secondary Examination
GPA: 5.00/5.00

Dhaka Residential Model College

Secondary School Certificate
GPA : 5.00/5.00



Projects

 
 
 
 
 

Bangla Grammatical Error Detection and Auto-correction

The project utilized BanglaBERT and BanglaT5 models to detect and correct grammatical errors in Bangla sentences. A synthetic dataset was generated by tagging parts of speech in Bangla sentences scraped from online news portals and swapping relevant words based on their parts of speech. Sequence tagging with BanglaBERT was employed to identify incorrect sentences, while machine translation with BanglaT5 was used to correct and auto-complete them. The approach achieved a 93% accuracy rate in detecting incorrect sentences with BanglaBERT and a SacreBLEU score of 90 for correction and auto-completion using BanglaT5.
[Code]

Transformer BERT T5 Machine Translation Sequence tagging
 
 
 
 
 

Bangla Named Entity Recognition

During the Bangladesh National NLP Hackathon, we developed a project to detect named entities in Bangla sentences using BanglaBERT, Support Vector Machine (SVM), and Conditional Random Fields (CRF). The dataset was augmented with techniques like word swapping, token replacement by label, synonym replacement, random insertion/deletion, stemming, lemmatization, and stopword removal. CRF was enhanced with features such as context words, word suffixes, named entity information, and digit features. The models achieved a macro average F1 score of 0.7938 with BanglaBERT, 0.68 with SVM, and 0.34 with CRF.
[Code]

BERT SVM CRF Data Augmentation
 
 
 
 
 

Vectorized CNN From Scratch

A vectorized version of a Convolutional Neural Network using only numpy without any deeplearning frameworks. Training and testing of the developed model is done on the NumtaDB: Bengali Handwritten Digits dataset. This project is part of the final assignment of CSE 472: Machine Learning Sessional offered by the CSE Department of BUET.
[Code]

Numpy CNN
 
 
 
 
 

Kronolog

This web app, developed for the BUET CSE Fest 2022 Deep Learning Sprint Competition, was designed to collect voice data from participants. It integrated Firebase’s authentication system to manage the login and registration of selected participants. The app successfully collected 10,000 voice data samples, which were later used in the competition and published by Bengali.ai.

Django ReactJS Firebase Data Collection
 
 
 
 
 

Sunset Vacation

This project is a community-based platform for listing and renting local homes, connecting hosts and travelers without owning any properties itself. It features a JWT-based authentication system, a search and recommendation system for travelers to find properties based on location, and Stripe API integration for monetary transactions. The platform also includes a map view using the OpenStreetMap API to display property locations, a chat system for communication between hosts and travelers, and a social Q&A section where users can ask and answer questions about travel destinations and seek suggestions.
[Code] [Demo]

Django rest framework Reactjs PostGreSQL Stripe api Openstreetmap Material UI Authentication system
 
 
 
 
 

Panacea

The project is a website designed for managing hospital operations, featuring a digital appointment and prescription system, an automated billing system, and JWT-based authentication. It includes various dashboards: one for hospital managers to schedule doctors and employees, view analytics like income and patient data; one for patients to track appointments, surgeries, and test results; and one for doctors to manage appointments, surgeries, prescriptions, and patient monitoring. Additionally, it integrates Kafka Stream API for patient condition simulation and provides dashboards for receptionists to handle appointments, payments, and admissions, as well as for technicians to upload test results.
[Code][Demo]

Django rest framework Reactjs Redux Oracle Kafka stream API Material UI Authentication system
 
 
 
 
 

Amphitetris

Developed during HackNSU 2.0, this platform, which earned 2nd runner-up among 20 teams, facilitates communication between garment factory owners and supply agents for purchasing raw materials. Factory owners can invite tenders on a shared platform where vendors bid, automating the supply chain process. Additionally, owners have the option to directly purchase materials from vendors, while vendors can collaborate with each other on the common platform.
[Code]

Django SQLite Authentication system
 
 
 
 
 

C Compiler From Scratch

This software translates a subset of the C programming language into Assembly language. It was developed using C++ to create a symbol table, Lex for lexical analysis, and Bison (YACC) for syntax and semantic analysis.
[Code]

YACC Lex Bison
 
 
 
 
 

GSM Based Home Protection System

This hardware system continuously monitors a home for potential dangers and alerts the user via phone call and message when irregularities are detected. It uses an MQ5 gas sensor to detect LPG gas at critical levels, a PIR sensor for motion detection as part of an anti-theft system, and flame and temperature sensors to detect fire outbreaks. The system incorporates a SIM900A GSM module for alerts and features a VueJS-based website, hosted on Firebase, for real-time monitoring via ThingSpeak.
[Code] [Demo]

Atmel Studio ATMega32 SIM900a GSM Module Sensors Vue.js Thingspeak Firebase



Achievements

  • Finalist, Bangladesh National NLP Hackathon hosted by Bangladesh Open Source Network (2023)
  • Finalist, Bangladesh Bangladesh Blockchain Olympiad (2022)
  • 2nd Runner up, HackNSU 2.0 (2020)
  • Finalist, DhakaAI (2020)
  • 9th position, Programming Contest, BUET CSE FEST (2019)
  • Dean's award for academinc excellence, Faculty of Electrical and Electronic Engineering, BUET (2019-2021)



Leadership Experience

ML Instructor

Generative AI Training @ Samsung R&D Institute
I organized internal Generative AI training at Samsung R&D Institute with 30 selected employees to enhance their skills in the Generative AI domain. My role included coordinating lecture assignments with other trainers and leading individual training sessions. Notable sessions I conducted included topics like Introduction to Deep Learning and ANN, Image Processing with CNN, Introduction to LLM, Prompt Engineering, and Fine-tuning LLMs. Our efforts were well-received, earning recognition from both the Managing Director of Samsung R&D Institute, Bangladesh and Korean Headquarters.

Organizer

Organized the first ever deep learning competition to be hosted in BUET CSE Festivals. My responsibilities ranged from setting up environment to collect dataset for the competition to training and supervising data collectors and ensuring data accuracy and completeness. We partnered with Bengali.AI and hosted this competiton in Kaggle. The competition was a resounding success with 122 competitors comprising of 59 teams and 470 submissions. Kaggle recognized our efforts by providing us the Kaggle Community Competition Creator Prize

Committee Member

DRMC Science Club
Played a key role in organizing the Science Fair 2016 at Dhaka Residential Model College.

Student Tutor

Tutored many students ranging from middle school to high school and undergrad-level students during my undergraduate years.



Technical Skills

 
 
 
 
 

Programming Languages

Python C C++ C# Java Javascript SQL Bash
 
 
 
 
 

Machine Learning Libraries

Pytorch Tensorflow Keras Scikit-learn Numpy Pandas OpenAI Langchain
 
 
 
 
 

Machine Learning Tools

Text Generation Web UI Label Studio
 
 
 
 
 

Frameworks

Django Nodejs Flask Fastapi Reactjs Bootstrap Git
 
 
 
 
 

Database

Oracle PostgreSQL MySQL MsSQL
 
 
 
 
 

Platforms

Firebase PythonAnywhere Stripe OpenStreetMap Github BitBucket
 
 
 
 
 

Security Tools

Zap Wireshark
 
 
 
 
 

Graphic designing

Figma Canva Draw.io
 
 
 
 
 

Miscellaneous

Latex Git Microsoft Office
 
 
 
 
 

Soft Skills

Team management Problem solving Requirement analysis Public speaking



Contact