Vinay K. Verma

AI Researcher and Computer Vision Engineer with 6+ years of experience in building scalable vision systems across industry and academia. Currently focused on learning structured world representations through semantic scene graphs and multimodal encoders for downstream tasks such as autonomous systems and healthcare applications.

Blog LinkedIn GitHub Resume PDF

Work Experience

HRI Lab @ IIIT Delhi

Researcher | Aug 2025 - Present

Working on multi-robot perception and representation learning using semantic and geometric scene understanding. Designing graph-based world models from multi-view camera inputs for robot localization, calibration, and interaction.

Visual Conception Group @ IIIT Delhi

Research Assistant | Oct 2023 - Jul 2024

Developed segmentation algorithms for large satellite and volumetric medical datasets on Jetson edge platforms. Presented at ICASSP 2025.

Stats Perform, London (Remote)

CV Engineer 2 | Nov 2022 - Dec 2023

Improved player tracking accuracy from 85% to 98% using temporal smoothing. Designed robust Jersey OCR (TensorFlow) with 90%+ accuracy.

Fynd Shopsense Retail & Wobot

ML Engineer & Sr. CV Engineer | 2019 - 2022

Built CV APIs handling 2.5M+ req/day. Designed multi-camera vehicle tracking for 300+ CCTV cams. Built lighting-invariant color detection.

Projects & Systems

Robotics & Manipulation

Designed a sensorless force estimation model utilizing joint currents for the uFactory Lite6. Engineered autonomous tabletop object sorting and AprilTag camera calibration pipelines.

Generative AI & Healthcare

Built ResearchBuddy, a RAG-Based QA pipeline using Gemini-2.5 Flash and LangChain. Developed OPD Assistant Medical Record System with PGIMER to convert unstructured raw data to meaningful records.

Publications & Patents

Towards Falsifiable Grammars of HRI

ACM HRI 2026, Edinburgh, UK

Resource-Efficient Perception for Vision Systems

ICASSP 2025, Hyderabad, India

Toward Falsifiable Grammars for NeuroDesign in HRI

NeuroHRI 2025

Foundation Models for 3D Biomedical Image Segmentation

CVPR Workshop MedSegFM 2025

Patent: A Method to Train Deep Learning Model

Indian Patent App. No. 202411050462

Education

PhD, Human Robot Interaction

IIIT Delhi | 2024 - 2029 (CGPA: 8.40)

M.Tech Research (CSE)

IIIT Delhi | 2024 - 2026 (CGPA: 8.40)

PG Diploma (Data Science & AI)

IIIT Delhi | 2022 - 2023 (CGPA: 9.17)

B.Tech (ECE)

ADGIPS (GGSIPU), Delhi | 2016 - 2020 (CGPA: 7.79)
Press W, A, S, D, or SPACE to Enter 2D Sandbox
Node_ID: n_001 Class: Entity.Person

Vinay K. Verma

AI Researcher | Computer Vision

Focused on learning structured world representations through semantic scene graphs and multimodal encoders.

PythonC++PyTorchROS2TensorFlowKubernetes
[ ACCESS LINKEDIN ]
T_Start: 2019 T_End: 2023 Class: Event.Experience

Industry Applied AI

Stats Perform

CV Engineer 2 | Nov '22 - Dec '23

Improved player tracking accuracy from 85% to 98% via temporal smoothing. Designed robust Jersey OCR (TensorFlow).

Fynd & Wobot Intelligence

ML Engineer / Sr. CV Engineer | 2019 - 2022

Built CV APIs handling 2.5M+ req/day. Designed multi-camera vehicle tracking for 300+ CCTV cams.

T_Start: 2023 T_End: Present Class: Event.Experience

Academic Research Labs

HRI Lab @ IIIT Delhi

Researcher | Aug '25 - Present

Designing graph-based world models from multi-view camera inputs for robot localization, calibration, and interaction.

Visual Conception Group

Research Asst. | Oct '23 - Jul '24

Developed segmentation algorithms for large satellite and volumetric medical datasets on Jetson edge platforms.

Class: Data.Systems Domain: Robotics & LLMs

Engineering Systems

Robotics & Manipulation

Sensorless Force Estimation (Lite6), Autonomous Tabletop Object Sorting, AprilTag Calibration Pipeline, ROS2 & Gazebo Sims.

Generative AI & Health

ResearchBuddy: RAG-Based QA using Gemini-2.5 Flash. OPD Assistant Medical Record System built with PGIMER.

Class: Data.Publications Count: 5+

Research Outputs

Towards Falsifiable Grammars of HRI

ACM HRI 2026, Edinburgh. Transforms HRI from descriptive analysis to a priori verification.

Resource-Efficient Perception

ICASSP 2025. Optimized computer vision for edge systems under resource constraints.

Patents & Foundations

NeuroHRI 2025. CVPR MedSegFM 2025 (Ranked 4th/210). Indian Patent App (202411050462) for DL Training Methods.

[ VIEW PAPER ]
T_Start: 2016 T_End: 2023 Class: Relation.Studied_At

Academic Foundations

PG Diploma (Data Science & AI)

IIIT Delhi | 2022 - 2023

B.Tech (ECE)

ADGIPS (GGSIPU) | 2016 - 2020
T_Start: 2024 T_End: 2029 Class: Relation.Studied_At

Advanced Academia

PhD, Human Robot Interaction

IIIT Delhi | 2024 - 2029

Department of CSE (Human Machine Interaction Lab).

M.Tech Research (CSE)

IIIT Delhi | 2024 - 2026

Thesis: Localized Perception for Constrained Vision Systems.

Spatio-Temporal Telemetry
TEMPUS (X) 2026.0
DOMAIN (Y) 0000.0
VELOCITY 00.0
SCORE 0
LIVES 5
WEAPON STD
AUTO-TRAVERSE OFF
Agent Controls
Navigate A / D / Steer
Accelerate W / Mouse Hold
Fire Gun SPACE
Auto-Traverse E / Q
Inspect Node ENTER
Press ENTER to Inspect Node