Current Projects
New York University
Incremental Skill Discovery for RL Agents
I’m working in the CILVR Lab with Prof. Lerrel Pinto on expanding
Citation: Shafiullah et. al., (2022)
Shafiullah,
Nur Muhammad & Lerrel
Pinto
(2022).
One After Another: Learning Incremental Skills for a Changing World.
arXiv preprint arXiv:2203.11176.
https://arxiv.org/abs/2203.11176
’s incremental skill discovery framework to manipulation environments like AllegroHand.
New York University
Speech Recognition for Mandarin Conversations
I’m working with Prof. Michael Picheny on DARPA’s CCU Program to improve speech recognition systems for conversational Mandarin, primarily on the AISHELL4 dataset
(Citation: Fu et. al., 2021
Fu,
Yihui,
Luyao
Cheng,
Shubo
Lv,
Yukai
Jv,
Yuxiang
Kong,
Zhuo
Chen,
Yanxin
Hu,
Lei
Xie,
Jian
Wu,
Hui
Bu &
others
(2021).
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario.
arXiv preprint arXiv:2104.03603.
https://arxiv.org/abs/2104.03603
).
Course Projects
New York University
Contrastive Training with Masked Autoencoders
[Report]
Implemented a contrastive loss on Masked Autoencoders
(Citation: He et. al., 2021
He,
Kaiming,
Xinlei
Chen,
Saining
Xie,
Yanghao
Li,
Piotr
Dollár & Ross
Girshick
(2021).
Masked Autoencoders Are Scalable Vision Learners.
arXiv preprint arXiv:2111.06377.
https://arxiv.org/abs/2111.06377
)
Past Projects
Flixstock India
Bust Replacement on Photorealistic Model Images
I worked with StyleGANs to generate bust-level crops of photorealistic model images, making use of techniques like GAN inversion and style mixing. I also used classical methods like MLS
(Citation: Schaefer et. al., 2006
Schaefer,
Scott,
Travis
McPhail & Joe
Warren
(2006).
Image Deformation Using Moving Least Squares. In
ACM SIGGRAPH 2006 Papers. (pp. 533–540).
https://dl.acm.org/doi/10.1145/1179352.1141920
) to create an end-to-end pipeline for bust replacement upto 1024px.
IBM Research
Neural Conversational Question Answering
I worked with Danish Contractor on improving question answering systems on the ShARC dataset
(Citation: Saeidi et. al., 2018
Saeidi,
Marzieh,
Max
Bartolo,
Patrick
Lewis,
Sameer
Singh,
Tim
Rocktäschel,
Mike
Sheldon,
Guillaume
Bouchard & Sebastian
Riedel
(2018).
Interpretation of Natural Language Rules in Conversational Machine Reading.
arXiv preprint arXiv:1809.01494.
https://arxiv.org/abs/1809.01494
). This led to a patent (US 16/892805), and a paper
(Citation: Verma et. al., 2020
Verma,
Nikhil,
Abhishek
Sharma,
Dhiraj
Madan,
Danish
Contractor,
Harshit
Kumar & Sachindra
Joshi
(2020).
Neural Conversational QA: Learning to Reason vs Exploiting Patterns.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP).
https://doi.org/10.18653/v1/2020.emnlp-main.589
).
Even Further
Microsoft Research
Code to Natural Language
[Report]
I worked with Navin Goyal on techniques to describe code (SQL Queries and bash
commands in particular) in natural language as part of my undergrad thesis.
IBM Research
Question Generation
[Report]
I worked on improving techniques to generate answerable questions from text.