I am a postdoctoral researcher at CUHK T Stone Robotics Institute, where I work on building machines that understand the real world from videos with minimal supervision.
I am interested in learning representation from videos (along with audio as free resources) with minimal supervision. Recently, I also develop an interest in fundamental machine learning theory.
This work is an extension of our CVPR 2019 paper. I recommend the ablation study section. Our approach achieved decent performance after just one training epoch.
Line correspondences are mapped into vectors tangent to sphere. Neighboring vectors mapped from inliers exhibit a local trend consistency (analogous to “a school of fish”).
This work is inspired by the observation that human visual system is sensitive to video pace, e.g., slow motion, a widely used technique in film making.
Temporal encoded kinematics features are proposed for action recognition, which compute the linear velocity and orientation displacement based on human skeleton data.
This is an editor invitation paper. A brief review is presented to introduce our recent works on machine intelligence for real-world applications of robots. One technology leads to a startup company VisionNav.
A caring robot is developed to detect dangerous behavior of children in the domestic environment based on action recognition and object recognition technologies.
Before PhD
During my undergraduate study, I was lucky enough to work with Prof. Wei Li on a defense–intrusion interaction optimization problem. This work is published in a Tier 1 applied mathematics journal.
An optimal interception strategy of the defender is provided with interpretations of its physical meaning, which depends on relative mobility of the intruder and defender.