0%

Hello, My name is Jinrong Zhang

πŸ‘‹ Hello,it's a happy day~~~

  • My name is Jinrong Zhang, and I am a researcher specializing in computer vision and multimodal large models. πŸš€
  • If you have any interest in collaboration or academic exchange, please feel free to contact me.

πŸ§‘β€πŸ’» About Me

πŸ“š PhD Student in Electronic Information at Harbin Institute of Technology, Shenzhen.
πŸ”¬ Research Interests:

  • Video Understanding and Generation
  • Multimodal Representation
  • Temporal Action Segmentation

πŸ“„ Research Papers

I love publishing and sharing my findings with the world! Here's a list of some of my published research papers:

  1. Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm – AAAI, CCF-A, 2025

    πŸ”— Link to orginal paper

  2. End-to-End Streaming Video Temporal Action Segmentation with Reinforce Learning – TNNLS, CCF-B, IF=10.2, 2025

    πŸ”— Link to orginal paper

  3. Flexible Streaming Temporal Action Segmentation with Diffusion Models – ICME, CCF-B, 2025

    πŸ”— Accepted (to be indexed soon)

  4. DTOS: Dynamic Time Object Sensing with Large Multimodal Model – CVPR, CCF-A, 2025

    πŸ”— Accepted (to be indexed soon)

  5. Cluster-Refined Optimal Transport for Unsupervised Action Segmentation – ICASSP, CCF-B, 2025

    πŸ”— Link to orginal paper

  6. Unsupervised Temporal Action Segmentation Based on Wavelet Feature Processing - IJCNN, CCF-C. 2025

    πŸ”— Accepted (to be indexed soon)

On the Papers page, you can also access the key details of these research papers.


πŸ’Ό Internship Experience

  • Xiaomi AI Lab – AI Research Intern
    2024/2 – 2025/10
    • I provided a large model solution for access permission detection at the Xiaomi car factory and successfully implemented it.
    • During my internship, I published a paper in AAAI.

🌐 My Profile