Artificial Intelligence & Medical Image Analysis Lab
The Group Artificial Intelligence and Medical Image Analysis (AIMIA) belongs to the Digital Media Research Center of Peking University, guided by Dr. Chen Jie, the associate professor of the School of Electronic and Computer Engineering of Peking University.
AIMIA has established cooperative relationships with other research groups, including international research groups, in the Digital Media Research Center of Peking University, as well as in Pengcheng Laboratory and Shenzhen Bay Laboratory.
Our research focuses on Multimodal Large Language Model (MLLM), Artificial Intelligence-Generated Content (AIGC), Embodied AI and AI for science.
Multimodal Large Language Models (MLLM) are advanced artificial intelligence models designed to understand and process multiple modalities, including text, images, audio, etc. By applying deep learning techniques and large sets of data for training, these models are able to figure out patterns and relationships between models.
Application Area 1: Natural Language Processing
- NLP model structure innovation
- Learning Feedback Enhancement
- Distributed Training
- Model Optimization
- Other related technologies.
Application Area 2: Computer Vision and Pattern Recognition
- Image Segmentation and Classification
- Image Semantic Understanding
- Semi-supervised Learning
- Weakly Supervised Learning, etc.
Application Area 3: Multimodal and Multi-perspective Learning
Explore how to combine multiple sources of information and learn information in datasets from multiple perspectives. These applications include graphic retrieval, 3D object detection, machine translation, etc.
Application Area 4: Medical Image Analysis
The purpose of Medical Image Analysis is to detect the interest areas and targets in medical images qualitatively or quantitatively, and to explore the connotation of images, so as to provide information reference for clinicians or researchers. The main research fields include image classification and segmentation, disease diagnosis, image registration, medical report generation, etc.
AIGC aims at using artificial intelligence algorithms to generate content with certain creativity and quality. It applies model training and data learning to generate content based on input criteria and guidance. For instance, if we input keywords, descriptions or samples, AIGC will generate articles, images, audio, etc. accordingly.
Embodied intelligence is a subfield of AI research. It focuses on enabling intelligent systems to perceive, understand, and engage with their environment through actual interaction. Embodied intelligence aims to integrate sensors, action mechanisms, and decision-making capabilities to enable intelligent systems to act and interact with the physical world as humans do.
By blending perception, cognition, and behavior, embodied intelligence models are able to learn the structure and rules of the environment and take appropriate actions. This comprehensive approach allows embodied intelligence to be applied to robotics, autonomous vehicles, augmented reality and more, facilitating closer interaction and cooperation between intelligent systems and the human environment.
AI for Science means pre-training models with strong generalization capabilities, thus enabling them to tackle multiple downstream tasks related to major bottlenecks in science, which include controllable targeted peptide design, virus mutation prediction, antibody generation and key property prediction, protein structure prediction, RNA structure and function prediction, etc.
Research interest: Computer vision and pattern recognition; AI4Science; Natural language processing; Medical image analysis.
Email:jiechen2019 at pku dot edu dot cn
Deep Learning, AI for Science, Bioinformatics
Semi-supervised Learning,
Computer Vision
Segmentation, Cross-modal, Deep Learning
Computer Vision,
Image Segmentation
Visual Question Answering,
Scene-Text based VQA
Text-Video Retrieval
Computer Vision
Image Restoration,
Super-resolution
Computer Vision,
Image Segmentation
Detection and Segmentation
Deep Learning,
Semi-supervised Learning
Computer Vision,
Semi-supervised Learning
Computer Vision,
Medical Image Analysis
Bioinformatics,
AI for Science
Bioinformatics,
AI for Science
Medical Report Generation
Computer Vision,
Image Segmentation
Multi-modal,
Deep Learning
Computer Vision,
AIGC
Object Detection,
Image Segmentation
Deep learning
Deep learning,
NLP
> Finalist for the 2022 "Gordon Bell New Crown Special Award" (The finalist only contains three teams in the world. AIMIA is the only Chinese team in the finalist.)
> First place in CAMEO 2022, an internationally renowned protein structure prediction competition
> The second Prize of National Science and Technology Progress Award in 2015: Local Modeling of Visual Patterns and Nonlinear Feature Acquisition Theory and Method
> The second prize of National Science and Technology Progress Award in 2005: Face Recognition Theory, Technology, System and Application
> The first prize of the Face verification competition of the First China Biometrics Competition (BVC2004)
> First place in the IAPR ICB 2006 Face Verification Competition, organized by Josef Kittler of the University of Surrey, UK
We welcome undergraduates and graduate students who are interested in scientific research and have strong ability to withstand pressure to apply for master’s degree and doctor’s degree in the School of Electronic and Computer Engineering of Peking University to engage in Computer Vision /AI for Science/ AIGC related research. (Click here for admission information)
jiechen2019 at pku dot edu dot cn
lijx at pkusz dot edu dot cn
Peking University Shenzhen Graduate School
No. 2199 Lishui Road, Nanshan District, Shenzhen
AI Website Software