Agentic Ops · Model Evaluation · RLVR Datasets

Md Fahim Arefin

Technical Manager, AppliedAI at Pareto

I build RLVR training datasets and RL environments that frontier AI labs use to train and evaluate their models, grounded in eight years of research in data mining and LLM evaluation.

Portrait of Md Fahim Arefin
15Peer-reviewed publications
3Funded research grants
8+Years across AI industry & academia
99Citations on Google Scholar

About

Fahim Arefin works at the intersection of frontier-model training and computer science research. At Pareto, he devises the strategy and groundwork for off-the-shelf RLVR training datasets and RL environments for frontier labs, and builds agentic pipelines that automate delivery operations end to end: intake, QA, verification, and shipping.

Before Pareto, he spent three years at Turing leading delivery programs for LLM training-data engagements with frontier AI labs across code, reasoning, and STEM domains. In parallel, he serves as Lecturer in Computer Science and Engineering at the University of Dhaka, where he teaches, advises students, and supervises research on data mining, LLM capabilities, and AI in education.

His research spans sequential pattern mining, LLM evaluation for competitive programming, and applied machine learning, published in venues including ACM TOSEM, Expert Systems with Applications, PAKDD, and ICSE workshops. He co-founded AlterYouth, a C2C scholarship platform funding 2,000+ monthly scholarships for underprivileged students in Bangladesh.

Experience

Oct 2025 – Present
Current

Technical Manager, AppliedAI

Pareto.AI

Devising strategy and laying the groundwork for off-the-shelf RLVR training datasets and RL environments for frontier labs. Building agentic pipelines that automate delivery ops end to end: intake, QA, verification, and shipping.

Read: Verifier engineering is the moat ↗
Mar 2023 – Present
Current

Lecturer, Computer Science & Engineering

University of Dhaka

Teaching CS courses and supervising research on data mining, LLM capabilities, and AI in education. Student Advisor for the department since Nov 2024. Previously Part-time Lecturer (Jul 2022 – Mar 2023).

Oct 2022 – Sep 2025

Program Delivery Manager

Turing.com

Held full P&L accountability for concurrent LLM training-data programs with frontier AI labs, maintaining 100% client retention while expanding gross margin. Scaled campaigns from 0 to 500+ trainers generating $4M in monthly revenue across code, reasoning, and STEM domains. Designed eval datasets that exposed model failure modes, with outcomes guiding clients' successive fine-tuning cycles.

Read: Closing note on my Turing chapter ↗
Jun 2019 – Present
Current

Co-founder

AlterYouth

C2C scholarship platform enabling anyone in the world to fund the education of underprivileged students in Bangladesh through digital banking. Funds 2,000+ scholarships every month, disbursing 35M+ BDT in scholarships annually.

Start a scholarship ↗
Feb 2018 – May 2019

Software Engineer

Beetles Cyber Security

Developed CrowdSpark, a proprietary engagement-management application to control, organize, and visualize the pentesting process.

CrowdSpark ↗

Research & Publications

Full citation record on Google Scholar ↗

Research Projects & Grants

2026
Consultant · RAG-Driven Business Intelligence Platform for Real-time, Predictive and Prescriptive Decision AnalyticsICSETEP, University Grants Commission
2025
Domain Expert · Enhancement of Bangla Language in ICT (EBLICT) through Research and DevelopmentBangladesh Computer Council
2023–24
Co-Principal Investigator · Utility-based Hypergraph MiningUGC Research Grant

Skills & Expertise

Frontier AI Training Data

RLVR dataset strategy, RL environment design, verifier engineering, rubric and QA systems, expert pipeline scaling for code, reasoning, and STEM domains.

Agentic Systems

Agentic pipeline architecture for delivery operations: automated intake, evaluation, verification, and shipping workflows.

Research

Sequential pattern mining, LLM evaluation methodology, sentiment analysis, applied machine learning; 15 peer-reviewed publications.

Leadership & Teaching

Program delivery for frontier-lab engagements, cross-functional team management, university teaching, research supervision, student advising.

Education & Certifications

2019
M.Sc. in Computer Science & EngineeringUniversity of Dhaka
2017
B.Sc. in Computer Science & EngineeringUniversity of Dhaka · Gold Medal for Academic Excellence, 51st Convocation
2025
Foundation Certificate in University Teaching & Learning (FCUTL)Institutional Quality Assurance Cell, University of Dhaka
2020
Certificate in Teaching & Learning (CTL)Center of Excellence for Teaching & Learning, Green University of Bangladesh

Honors & Awards

2024
Judge Coordinator & Problem Setter · Code Samurai HackathonJICA · BJIT · University of Dhaka
2023
Co-coach, DU_NotStrongEnough · ICPC World FinalsICPC · Egypt
2022
Judge Coordinator & Problem Setter · Code Samurai HackathonJICA · BJIT · University of Dhaka
2018
Gold Medal for Academic Excellence · 51st Convocation, B.Sc.University of Dhaka
2017
Dean's Award · Faculty of Engineering and TechnologyUniversity of Dhaka
2017
First Runner-Up, Health Category · Prothom Alo Apps ContestEATL · Bangladesh
2016
Best Student Software · Code Healthy with OpenShiftRed Hat · Global
2016
Best Mobile Application · VS Marketplace HackfestMicrosoft · Global

Voluntary Services

2024–
Moderator · CSEDU Students' ClubUniversity of Dhaka · Guiding club activities and student initiatives
2017–19
President · CSEDU Students' ClubUniversity of Dhaka · Led student activities for the CSE department
2012–13
General Secretary · Notre Dame English ClubDhaka · Organised literary events and managed club operations

Get in Touch

Interested in RL environments, training data partnerships, research collaboration, or teaching? Always happy to connect.