Research
My research lies at the broader area of computer vision and machine learning, with a particular focus on data- and computation-efficient deep learning models and their applications in computer vision. In the long term, my goal is to develop intelligent systems that can learn from, adapt to, and interact with the environment, and autonomously perform complex tasks in the real world.
My work includes an adaptive downsampling method [1] that selectively resamples pixels to accelerate inference without sacrificing accuracy, and a spatiotemporal state space model for prostate cancer detection in mpMRI [2]; open-vocabulary segmentation with vision langeuage models (VLMs) [3], efficient image enhancement with neural implicit representation [4] and diffusion models [5], semantic line detection from images with Hough representations [6], palmprint recognition with purely synthetic training data [7]. and token pruning of visual langeuage models (VLMs).
References
This page is written in MDX and the citations are automatically rendered from publications.bib using the rehype-citation plugin.