2025
-
Zhiying Lu, Chuanbin Liu, Xiaojun Chang, Yongdong Zhang, Hongtao Xie: DHVT: Dynamic Hybrid Vision Transformer for Small Dataset Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 47(4): 2615-2631 (2025)
-
Jiaming Li, Lingyun Yu, Runxin Liu, Hongtao Xie: A Detail-Aware Transformer to Generalizable Face Forgery Detection. IEEE Trans. Circuits Syst. Video Technol. 35(4): 3262-3275 (2025)
-
Lingyun Yu, Tian Xie, Chuanbin Liu, Guoqing Jin, Zhiguo Ding, Hongtao Xie: Distilling Multi-Level Semantic Cues Across Multi-Modalities for Face Forgery Detection. IEEE Trans. Circuits Syst. Video Technol. 35(5): 4698-4712 (2025)
-
Sun'ao Liu, Hongtao Xie, Jiannan Ge, Yongdong Zhang: ReferSAM: Unleashing Segment Anything Model for Referring Image Segmentation. IEEE Trans. Circuits Syst. Video Technol. 35(5): 4910-4922 (2025)
-
Jiannan Ge, Zhihang Liu, Pandeng Li, Lingxi Xie, Yongdong Zhang, Qi Tian, Hongtao Xie: Denoised and Dynamic Alignment Enhancement for Zero-Shot Learning. IEEE Trans. Image Process. 34: 1501-1515 (2025)
-
Yixuan Zhang, Chuanbin Liu, Yizhi Liu, Yifan Gao, Zhiying Lu, Hongtao Xie, Yongdong Zhang: Leveraging Concise Concepts With Probabilistic Modeling for Interpretable Visual Recognition. IEEE Trans. Multim. 27: 3117-3131 (2025)
-
Fengyuan Liu, Lingyun Yu, Quanwei Yang, Meng Shao, Hongtao Xie: High Fidelity Face Swapping via Facial Texture and Structure Consistency Mining. IEEE Trans. Multim. 27: 6168-6181 (2025)
-
Runxin Liu, Tian Xie, Jiaming Li, Lingyun Yu, Hongtao Xie: IDseq: Decoupled and Sequentially Detecting and Grounding Multi-Modal Media Manipulation. AAAI 2025: 496-504
-
Yifan Gao, Zihang Lin, Chuanbin Liu, Min Zhou, Tiezheng Ge, Bo Zheng, Hongtao Xie: PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering. CVPR 2025: 8083-8093
-
Zhihang Liu, Chen-Wei Xie, Pandeng Li, Liming Zhao, Longxiang Tang, Yun Zheng, Chuanbin Liu, Hongtao Xie: Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models. CVPR 2025: 8568-8578
-
Tianhao Qi, Jianlong Yuan, Wanquan Feng, Shancheng Fang, Jiawei Liu, SiYu Zhou, Qian He, Hongtao Xie, Yongdong Zhang: Mask^2DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation. CVPR 2025: 18837-18846
-
Bangbang Zhou, Zuan Gao, Zixiao Wang, Boqiang Zhang, Yuxin Wang, Zhineng Chen, Hongtao Xie: SynTab-LLaVA: Enhancing Multimodal Table Understanding with Decoupled Synthesis. CVPR 2025: 24796-24806
-
Yaqi Cai, Shancheng Fang, Yadong Qu, Xiaorui Wang, Meng Shao, Hongtao Xie: IterMeme: Expert-Guided Multimodal LLM for Interactive Meme Creation with Layout-Aware Generation. IJCAI 2025: 720-728
2024
-
Quanwei Yang, Lingyun Yu, Fengyuan Liu, Yun Song, Meng Shao, Guoqing Jin, Hongtao Xie: Symmetrical Siamese Network for pose-guided person synthesis. Comput. Vis. Image Underst. 248: 104134 (2024)
-
Li Wang, Lingyun Yu, Yongdong Zhang, Hongtao Xie: Generalizable Speech Spoofing Detection Against Silence Trimming With Data Augmentation and Multi-Task Meta-Learning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3296-3310 (2024)
-
Mingqi Fang, Lingyun Yu, Hongtao Xie, Qingfeng Tan, Zhiyuan Tan, Amir Hussain, Zezheng Wang, Jiahong Li, Zhihong Tian: STIDNet: Identity-Aware Face Forgery Detection With Spatiotemporal Knowledge Distillation. IEEE Trans. Comput. Soc. Syst. 11(4): 5354-5366 (2024)
-
Zixiao Wang, Hongtao Xie, Yuxin Wang, Hai Xu, Guoqing Jin: DCFP: Distribution Calibrated Filter Pruning for Lightweight and Accurate Long-Tail Semantic Segmentation. IEEE Trans. Circuits Syst. Video Technol. 34(7): 6063-6076 (2024)
-
Peiqi Jiang, Hongtao Xie, Lingyun Yu, Guoqing Jin, Yongdong Zhang: Exploring Bi-Level Inconsistency via Blended Images for Generalizable Face Forgery Detection. IEEE Trans. Inf. Forensics Secur. 19: 6573-6588 (2024)
-
Tianhao Qi, Hongtao Xie, Pandeng Li, Jiannan Ge, Yongdong Zhang: Balanced Classification: A Unified Framework for Long-Tailed Object Detection. IEEE Trans. Multim. 26: 3088-3101 (2024)
-
Hongtao Xie, Yan Jiang, Lei Zhang, Pandeng Li, Dongming Zhang, Yongdong Zhang: Semantic-Enhanced Proxy-Guided Hashing for Long-Tailed Image Retrieval. IEEE Trans. Multim. 26: 9499-9514 (2024)
-
Jiannan Ge, Hongtao Xie, Pandeng Li, Lingxi Xie, Shaobo Min, Yongdong Zhang: Towards Discriminative Feature Generation for Generalized Zero-Shot Learning. IEEE Trans. Multim. 26: 10514-10529 (2024)
-
Mingqi Fang, Lingyun Yu, Yun Song, Yongdong Zhang, Hongtao Xie: IEIRNet: Inconsistency Exploiting Based Identity Rectification for Face Forgery Detection. IEEE Trans. Multim. 26: 11232-11245 (2024)
-
Zhihang Liu, Jun Li, Hongtao Xie, Pandeng Li, Jiannan Ge, Sun'ao Liu, Guoqing Jin: Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval. AAAI 2024: 3855-3863
-
Tianhao Qi, Shancheng Fang, Yanze Wu, Hongtao Xie, Jiawei Liu, Lang Chen, Qian He, Yongdong Zhang: DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations. CVPR 2024: 8693-8702
-
Yuhao Sun, Lingyun Yu, Hongtao Xie, Jiaming Li, Yongdong Zhang: DiffAM: Diffusion-Based Adversarial Makeup Transfer for Facial Privacy Protection. CVPR 2024: 24584-24594
-
Jianjun Xu, Yuxin Wang, Hongtao Xie, Yongdong Zhang: OTE: Exploring Accurate Scene Text Recognition Using One Token. CVPR 2024: 28327-28336
-
Boqiang Zhang, Hongtao Xie, Zuan Gao, Yuxin Wang: Choose What You Need: Disentangled Representation Learning for Scene Text Recognition, Removal and Editing. CVPR 2024: 28358-28368
-
Jiannan Ge, Lingxi Xie, Hongtao Xie, Pandeng Li, Xiaopeng Zhang, Yong-Dong Zhang, Qi Tian: AlignZeg: Mitigating Objective Misalignment for Zero-Shot Semantic Segmentation. ECCV (43) 2024: 142-161
-
Zixiao Wang, Hongtao Xie, Yuxin Wang, Yadong Qu, Fengjun Guo, Pengwei Liu: Leveraging Text Localization for Scene Text Removal via Text-Aware Masked Image Modeling. ECCV (66) 2024: 357-373
-
Zuan Gao, Yuxin Wang, Yadong Qu, Boqiang Zhang, Zixiao Wang, Jianjun Xu, Hongtao Xie: Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition. IJCAI 2024: 767-775
-
Bangbang Zhou, Yadong Qu, Zixiao Wang, Zicheng Li, Boqiang Zhang, Hongtao Xie: Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition. IJCAI 2024: 1762-1770
-
Yiding Li, Lingyun Yu, Li Wang, Hongtao Xie: Control-Talker: A Rapid-Customization Talking Head Generation Method for Multi-Condition Control and High-Texture Enhancement. ACM Multimedia 2024: 3519-3527
-
Yadong Qu, Yuxin Wang, Bangbang Zhou, Zixiao Wang, Hongtao Xie, Yongdong Zhang: Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing. NeurIPS 2024
-
Quanwei Yang, Jiazhi Guan, Kaisiyuan Wang, Lingyun Yu, Wenqing Chu, Hang Zhou, ZhiQiang Feng, Haocheng Feng, Errui Ding, Jingdong Wang, Hongtao Xie: ShowMaker: Creating High-Fidelity 2D Human Video via Fine-Grained Diffusion Modeling. NeurIPS 2024
-
Boqiang Zhang, Zuan Gao, Yadong Qu, Hongtao Xie: How Control Information Influences Multilingual Text Image Generation and Editing? NeurIPS 2024
2023
-
Shancheng Fang, Zhendong Mao, Hongtao Xie, Yuxin Wang, Chenggang Yan, Yongdong Zhang: ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 7123-7141 (2023)
-
Pandeng Li, Hongtao Xie, Yan Jiang, Jiannan Ge, Yongdong Zhang: Neighborhood-Adaptive Multi-Cluster Ranking for Deep Metric Learning. IEEE Trans. Circuits Syst. Video Technol. 33(4): 1952-1965 (2023)
-
Yuxin Wang, Hongtao Xie, Zixiao Wang, Yadong Qu, Yongdong Zhang: What is the Real Need for Scene Text Removal? Exploring the Background Integrity and Erasure Exhaustivity Properties. IEEE Trans. Image Process. 32: 4567-4580 (2023)
-
Fanchao Lin, Zhaofan Qiu, Chuanbin Liu, Ting Yao, Hongtao Xie, Yongdong Zhang: Prototypical Matching Networks for Video Object Segmentation. IEEE Trans. Image Process. 32: 5623-5636 (2023)
-
Jiaming Li, Hongtao Xie, Lingyun Yu, Xingyu Gao, Yongdong Zhang: Discriminative Feature Mining Based on Frequency Information and Metric Learning for Face Forgery Detection. IEEE Trans. Knowl. Data Eng. 35(12): 12167-12180 (2023)
-
Lingfeng Ma, Hongtao Xie, Chuanbin Liu, Yongdong Zhang: Learning Cross-Channel Representations for Semantic Segmentation. IEEE Trans. Multim. 25: 2774-2787 (2023)
-
Yadong Qu, Hongtao Xie, Shancheng Fang, Yuxin Wang, Yongdong Zhang: ADNet: Rethinking the Shrunk Polygon-Based Approach in Scene Text Detection. IEEE Trans. Multim. 25: 6983-6996 (2023)
-
Zilong Fu, Hongtao Xie, Shancheng Fang, Yuxin Wang, Mengting Xing, Yongdong Zhang: Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text Detection. ACM Trans. Multim. Comput. Commun. Appl. 19(1s): 29:1-29:24 (2023)
-
Zhihua Shang, Hongtao Xie, Lingyun Yu, Zhengjun Zha, Yongdong Zhang: Constructing Spatio-Temporal Graphs for Face Forgery Detection. ACM Trans. Web 17(3): 23:1-23:25 (2023)
-
Jingyuan Xu, Hongtao Xie, Qingfeng Tan, Hai Wu, Chuanbin Liu, Sicheng Zhang, Zhendong Mao, Yongdong Zhang: Multi-task hourglass network for online automatic diagnosis of developmental dysplasia of the hip. World Wide Web (WWW) 26(2): 539-559 (2023)
-
Yadong Qu, Qingfeng Tan, Hongtao Xie, Jianjun Xu, YuXin Wang, Yongdong Zhang: Exploring Stroke-Level Modifications for Scene Text Editing. AAAI 2023: 2119-2127
-
Sun'ao Liu, Yiheng Zhang, Zhaofan Qiu, Hongtao Xie, Yongdong Zhang, Ting Yao: Learning Orthogonal Prototypes for Generalized Few-Shot Semantic Segmentation. CVPR 2023: 11319-11328
-
Pandeng Li, Chen-Wei Xie, Liming Zhao, Hongtao Xie, Jiannan Ge, Yun Zheng, Deli Zhao, Yongdong Zhang: Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval. ICCV 2023: 4077-4087
-
Boqiang Zhang, Hongtao Xie, Yuxin Wang, Jianjun Xu, Yongdong Zhang: Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition. IJCAI 2023: 1704-1712
-
Zixiao Wang, Hongtao Xie, Yuxin Wang, Jianjun Xu, Boqiang Zhang, Yongdong Zhang: Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition. ACM Multimedia 2023: 509-518
-
Sun'ao Liu, Yiheng Zhang, Zhaofan Qiu, Hongtao Xie, Yongdong Zhang, Ting Yao: CARIS: Context-Aware Referring Image Segmentation. ACM Multimedia 2023: 779-788
-
Mingqi Fang, Lingyun Yu, Hongtao Xie, Junqiang Wu, Zezheng Wang, Jiahong Li, Yongdong Zhang: RAIRNet: Region-Aware Identity Rectification for Face Forgery Detection. ACM Multimedia 2023: 1455-1464
-
Keran Wang, Hongtao Xie, Yuxin Wang, Dongming Zhang, Yadong Qu, Zuan Gao, Yongdong Zhang: Masked Text Modeling: A Self-Supervised Pre-training Method for Scene Text Detection. ACM Multimedia 2023: 2006-2015
-
Wanting Yin, Hongtao Xie, Lei Zhang, Jiannan Ge, Pandeng Li, Chuanbin Liu, Yongdong Zhang: Frequency-based Zero-Shot Learning with Phase Augmentation. ACM Multimedia 2023: 3181-3189
-
Fengyuan Liu, Lingyun Yu, Hongtao Xie, Chuanbin Liu, Zhiguo Ding, Quanwei Yang, Yongdong Zhang: High Fidelity Face Swapping via Semantics Disentanglement and Structure Enhancement. ACM Multimedia 2023: 6907-6917
-
Yifan Gao, Jinpeng Lin, Min Zhou, Chuanbin Liu, Hongtao Xie, Tiezheng Ge, Yuning Jiang: TextPainter: Multimodal Text Image Generation with Visual-harmony and Text-comprehension for Poster Design. ACM Multimedia 2023: 7236-7246
-
Yan Jiang, Hongtao Xie, Lei Zhang, Pandeng Li, Dongming Zhang, Yongdong Zhang: Dual Dynamic Proxy Hashing Network for Long-tailed Image Retrieval. ACM Multimedia 2023: 8942-8953
-
Pandeng Li, Chen-Wei Xie, Hongtao Xie, Liming Zhao, Lei Zhang, Yun Zheng, Deli Zhao, Yongdong Zhang: MomentDiff: Generative Video Moment Retrieval from Random to Real. NeurIPS 2023
2022
-
Jiaqi Zhu, Feng Dai, Lingyun Yu, Hongtao Xie, Lidong Wang, Bo Wu, Yongdong Zhang: Attention-guided transformation-invariant attack for black-box adversarial examples. Int. J. Intell. Syst. 37(5): 3142-3165 (2022)
-
Yu Zhou, Hongtao Xie, Shancheng Fang, Yongdong Zhang: Semi-Supervised Text Detection With Accurate Pseudo-Labels. IEEE Signal Process. Lett. 29: 1272-1276 (2022)
-
Fanchao Lin, Hongtao Xie, Chuanbin Liu, Yongdong Zhang: Bilateral Temporal Re-Aggregation for Weakly-Supervised Video Object Segmentation. IEEE Trans. Circuits Syst. Video Technol. 32(7): 4498-4512 (2022)
-
Zheren Fu, Zhendong Mao, Chenggang Yan, An-An Liu, Hongtao Xie, Yongdong Zhang: Self-Supervised Synthesis Ranking for Deep Metric Learning. IEEE Trans. Circuits Syst. Video Technol. 32(7): 4736-4750 (2022)
-
Yuxin Wang, Hongtao Xie, Shancheng Fang, Mengting Xing, Jing Wang, Shenggao Zhu, Yongdong Zhang: PETR: Rethinking the Capability of Transformer-Based Language Model in Scene Text Recognition. IEEE Trans. Image Process. 31: 5585-5598 (2022)
-
Pandeng Li, Hongtao Xie, Shaobo Min, Jiannan Ge, Xun Chen, Yongdong Zhang: Deep Fourier Ranking Quantization for Semi-Supervised Image Retrieval. IEEE Trans. Image Process. 31: 5909-5922 (2022)
-
Ziheng Hu, Hongtao Xie, Lingyun Yu, Xingyu Gao, Zhihua Shang, Yongdong Zhang: Dynamic-Aware Federated Learning for Face Forgery Video Detection. ACM Trans. Intell. Syst. Technol. 13(4): 57:1-57:25 (2022)
-
Pandeng Li, Hongtao Xie, Shaobo Min, Zheng-Jun Zha, Yongdong Zhang: Online Residual Quantization Via Streaming Data Correlation Preserving. IEEE Trans. Multim. 24: 981-994 (2022)
-
Lingyun Yu, Hongtao Xie, Yongdong Zhang: Multimodal Learning for Temporally Coherent Talking Face Generation With Articulator Synergy. IEEE Trans. Multim. 24: 2950-2962 (2022)
-
Mengting Xing, Hongtao Xie, Qingfeng Tan, Shancheng Fang, Yuxin Wang, Zhengjun Zha, Yongdong Zhang: Boundary-Aware Arbitrary-Shaped Scene Text Detector With Learnable Embedding Network. IEEE Trans. Multim. 24: 3129-3143 (2022)
-
Pandeng Li, Yan Li, Hongtao Xie, Lei Zhang: Neighborhood-Adaptive Structure Augmented Metric Learning. AAAI 2022: 1367-1375
-
Sun'ao Liu, Hongtao Xie, Hai Xu, Yongdong Zhang, Qi Tian: Partial Class Activation Attention for Semantic Segmentation. CVPR 2022: 16815-16824
-
Pandeng Li, Hongtao Xie, Jiannan Ge, Lei Zhang, Shaobo Min, Yongdong Zhang: Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval. ECCV (14) 2022: 181-197
-
Yuxin Wang, Hongtao Xie, Mengting Xing, Jing Wang, Shenggao Zhu, Yongdong Zhang: Detecting Tampered Scene Text in the Wild. ECCV (28) 2022: 215-232
-
Yunyan Yan, Chuanbin Liu, Hongtao Xie, Sicheng Zhang, Zhendong Mao: Weakly Supervised Pediatric Bone Age Assessment Using Ultrasonic Images via Automatic Anatomical RoI Detection. ICMR 2022: 647-653
-
Quanwei Yang, Xinchen Liu, Wu Liu, Hongtao Xie, Xiaoyan Gu, Lingyun Yu, Yongdong Zhang: REMOT: A Region-to-Whole Framework for Realistic Human Motion Transfer. ACM Multimedia 2022: 1128-1137
-
Jiaming Li, Hongtao Xie, Lingyun Yu, Yongdong Zhang: Wavelet-enhanced Weakly Supervised Local Feature Learning for Face Forgery Detection. ACM Multimedia 2022: 1299-1308
-
Yunning Cao, Ye Ma, Min Zhou, Chuanbin Liu, Hongtao Xie, Tiezheng Ge, Yuning Jiang: Geometry Aligned Variational Transformer for Image-conditioned Layout Generation. ACM Multimedia 2022: 1561-1571
-
Jiannan Ge, Hongtao Xie, Shaobo Min, Pandeng Li, Yongdong Zhang: Dual Part Discovery Network for Zero-Shot Learning. ACM Multimedia 2022: 3244-3252
-
Jingyuan Xu, Hongtao Xie, Chuanbin Liu, Yongdong Zhang: Proxy Probing Decoder for Weakly Supervised Object Localization: A Baseline Investigation. ACM Multimedia 2022: 4185-4193
-
Jianjun Xu, Hongtao Xie, Hai Xu, Yuxin Wang, Sun'ao Liu, Yongdong Zhang: Boat in the Sky: Background Decoupling and Object-aware Pooling for Weakly Supervised Semantic Segmentation. ACM Multimedia 2022: 5783-5792
2021
-
An-An Liu, Heyu Zhou, Weizhi Nie, Zhenguang Liu, Wu Liu, Hongtao Xie, Zhendong Mao, Xuanya Li, Dan Song: Hierarchical multi-view context modelling for 3D object classification and retrieval. Inf. Sci. 547: 984-995 (2021)
-
Zhihua Shang, Hongtao Xie, Zhengjun Zha, Lingyun Yu, Yan Li, Yongdong Zhang: PRRNet: Pixel-Region relation network for face forgery detection. Pattern Recognit. 116: 107950 (2021)
-
Chuanbin Liu, Hongtao Xie, Yongdong Zhang: Self-Supervised Attention Mechanism for Pediatric Bone Age Assessment With Efficient Weak Annotation. IEEE Trans. Medical Imaging 40(10): 2685-2697 (2021)
-
Jingyuan Xu, Hongtao Xie, Chuanbin Liu, Fang Yang, Sicheng Zhang, Xun Chen, Yongdong Zhang: Hip Landmark Detection With Dependency Mining in Ultrasound Image. IEEE Trans. Medical Imaging 40(12): 3762-3774 (2021)
-
Shaobo Min, Xuejin Chen, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang: A Mutually Attentive Co-Training Framework for Semi-Supervised Recognition. IEEE Trans. Multim. 23: 899-910 (2021)
-
Yuxin Wang, Hongtao Xie, Zhengjun Zha, Youliang Tian, Zilong Fu, Yongdong Zhang: R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection. IEEE Trans. Multim. 23: 1316-1329 (2021)
-
Shaobo Min, Hantao Yao, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang: Domain-Oriented Semantic Embedding for Zero-Shot Learning. IEEE Trans. Multim. 23: 3919-3930 (2021)
-
Jiannan Ge, Hongtao Xie, Shaobo Min, Yongdong Zhang: Semantic-guided Reinforced Region Embedding for Generalized Zero-Shot Learning. AAAI 2021: 1406-1414
-
Fanchao Lin, Hongtao Xie, Yan Li, Yongdong Zhang: Query-Memory Re-Aggregation for Weakly-supervised Video Object Segmentation. AAAI 2021: 2038-2046
-
Jiaming Li, Hongtao Xie, Jiahong Li, Zhongyuan Wang, Yongdong Zhang: Frequency-Aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection. CVPR 2021: 6458-6467
-
Shancheng Fang, Hongtao Xie, Yuxin Wang, Zhendong Mao, Yongdong Zhang:
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition. CVPR 2021: 7098-7107
-
Yuxin Wang, Hongtao Xie, Shancheng Fang, Jing Wang, Shenggao Zhu, Yongdong Zhang:
From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network. ICCV 2021: 14174-14183
-
Lingfeng Ma, Chuanbin Liu, Sicheng Zhang, Yizhi Liu, Hongtao Xie:
Global Characteristic Guided Landmark Detection for Genu Valgus and Varus Diagnosis. ICIG (2) 2021: 523-534
-
Ziheng Hu, Hongtao Xie, Yuxin Wang, Jiahong Li, Zhongyuan Wang, Yongdong Zhang:
Dynamic Inconsistency-aware DeepFake Video Detection. IJCAI 2021: 736-742
-
Zilong Fu, Hongtao Xie, Guoqing Jin, Junbo Guo:
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition. ICMR 2021: 638-644
-
Jianjun Chen, Shancheng Fang, Hongtao Xie, Zheng-Jun Zha, Yue Hu, Jianlong Tan:
End-to-end Boundary Exploration for Weakly-supervised Semantic Segmentation. ACM Multimedia 2021: 2381-2390
-
Yu Zhou, Hongtao Xie, Shancheng Fang, Jing Wang, Zhengjun Zha, Yongdong Zhang:
TDI TextSpotter: Taking Data Imbalance into Account in Scene Text Spotting. ACM Multimedia 2021: 2510-2518
-
Bingyu Hu, Zheng-Jun Zha, Jiawei Liu, Xierong Zhu, Hongtao Xie:
Cluster and Scatter: A Multi-grained Active Semi-supervised Learning Framework for Scalable Person Re-identification. ACM Multimedia 2021: 2605-2614
2020
-
Xiaonan Guo, Hongtao Xie, Hai Xu, Yongdong Zhang:
Global context and boundary structure-guided network for cross-modal organ segmentation. Inf. Process. Manag. 57(4): 102252 (2020)
-
Shaobo Min, Hantao Yao, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
Multi-Objective Matrix Normalization for Fine-Grained Visual Recognition. IEEE Trans. Image Process. 29: 4996-5009 (2020)
-
Yu Zhang, Xingyu Gao, Zhenyu Chen, Huicai Zhong, Hongtao Xie, Chenggang Yan:
Mining Spatial-Temporal Similarity for Visual Tracking. IEEE Trans. Image Process. 29: 8107-8119 (2020)
-
Chuanbin Liu, Hongtao Xie, Sicheng Zhang, Zhendong Mao, Jun Sun, Yongdong Zhang:
Misshapen Pelvis Landmark Detection With Local-Global Feature Learning for Diagnosing Developmental Dysplasia of the Hip. IEEE Trans. Medical Imaging 39(12): 3944-3954 (2020)
-
Chuanbin Liu, Hongtao Xie, Zhengjun Zha, Lingyun Yu, Zhineng Chen, Yongdong Zhang:
Bidirectional Attention-Recognition Model for Fine-Grained Object Classification. IEEE Trans. Multim. 22(7): 1785-1795 (2020)
-
Zheng-Jun Zha, Chong Wang, Dong Liu, Hongtao Xie, Yongdong Zhang:
Robust Deep Co-Saliency Detection With Group Semantic and Pyramid Attention. IEEE Trans. Neural Networks Learn. Syst. 31(7): 2398-2408 (2020)
-
Chuanbin Liu, Hongtao Xie, Zheng-Jun Zha, Lingfeng Ma, Lingyun Yu, Yongdong Zhang:
Filtration and Distillation: Enhancing Region Attention for Fine-Grained Visual Categorization. AAAI 2020: 11555-11562
-
Hai Wu, Hongtao Xie, Chuanbin Liu, Zheng-Jun Zha, Jun Sun, Yongdong Zhang:
CircleNet for Hip Landmark Detection. AAAI 2020: 12370-12377
-
Benfeng Xu, Licheng Zhang, Zhendong Mao, Quan Wang, Hongtao Xie, Yongdong Zhang:
Curriculum Learning for Natural Language Understanding. ACL 2020: 6095-6104
-
Chunxiao Liu, Zhendong Mao, Tianzhu Zhang, Hongtao Xie, Bin Wang, Yongdong Zhang:
Graph Structured Network for Image-Text Matching. CVPR 2020: 10918-10927
-
Yuxin Wang, Hongtao Xie, Zheng-Jun Zha, Mengting Xing, Zilong Fu, Yongdong Zhang:
ContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text Detection. CVPR 2020: 11750-11759
-
Shaobo Min, Hantao Yao, Hongtao Xie, Chaoqun Wang, Zheng-Jun Zha, Yongdong Zhang:
Domain-Aware Visual Bias Eliminating for Generalized Zero-Shot Learning. CVPR 2020: 12661-12670
-
Zixiao Wang, Hai Xu, Youliang Tian, Hongtao Xie:
Hierarchical Consistency and Refinement for Semi-supervised Medical Segmentation. ICPR Workshops (6) 2020: 267-276
-
Zhikun Huang, Zhedong Zheng, Chenggang Yan, Hongtao Xie, Yaoqi Sun, Jianzhong Wang, Jiyong Zhang:
Real-World Automatic Makeup via Identity Preservation Makeup Net. IJCAI 2020: 652-658
-
Chuanbin Liu, Hongtao Xie, Yunyan Yan, Zhendong Mao, Yongdong Zhang:
Learning Rich Attention for Pediatric Bone Age Assessment. MICCAI (1) 2020: 232-242
-
Yu Zhou, Hongtao Xie, Shancheng Fang, Yan Li, Yongdong Zhang:
CRNet: A Center-aware Representation for Detecting Text of Arbitrary Shapes. ACM Multimedia 2020: 2571-2580
-
Hai Xu, Hongtao Xie, Zheng-Jun Zha, Sun'ao Liu, Yongdong Zhang:
March on Data Imperfections: Domain Division and Domain Generalization for Semantic Segmentation. ACM Multimedia 2020: 3044-3053
-
Lixuan Meng, Chenggang Yan, Jun Li, Jian Yin, Wu Liu, Hongtao Xie, Liang Li:
Multi-Features Fusion and Decomposition for Age-Invariant Face Recognition. ACM Multimedia 2020: 3146-3154
-
Chuanbin Liu, Youliang Tian, Hongtao Xie:
Law Is Order: Protecting Multimedia Network Transmission by Game Theory and Mechanism Design. MMM (2) 2020: 651-668
-
Sun'ao Liu, Hai Xu, Yizhi Liu, Hongtao Xie:
Improving Brain Tumor Segmentation with Dilated Pseudo-3D Convolution and Multi-direction Fusion. MMM (1) 2020: 727-738
-
Shaobo Min, Hongtao Xie, Hantao Yao, Xuran Deng, Zheng-Jun Zha, Yongdong Zhang:
Hierarchical Granularity Transfer Learning. NeurIPS 2020
2019
-
Yanping Ma, Qiming Liu, Cuifeng Li, Yi Tang, Hongtao Xie:
Distributed data-dependent locality sensitive hashing. Int. J. High Perform. Comput. Netw. 13(3): 304-311 (2019)
-
Zhineng Chen, Wei Zhang, Bin Deng, Hongtao Xie, Xiaoyan Gu:
Name-face association with web facial image supervision. Multim. Syst. 25(1): 1-20 (2019)
-
Yanping Ma, Dongbao Yang, Hongtao Xie, Jian Yin:
Supervised deep hashing for image content security. Multim. Tools Appl. 78(1): 661-676 (2019)
-
Hongtao Xie, Dongbao Yang, Nannan Sun, Zhineng Chen, Yongdong Zhang:
Automated pulmonary nodule detection in CT images using deep convolutional neural networks. Pattern Recognit. 85: 109-119 (2019)
-
Hongtao Xie, Zhendong Mao, Yongdong Zhang, Han Deng, Chenggang Yan, Zhineng Chen:
Double-Bit Quantization and Index Hashing for Nearest Neighbor Search. IEEE Trans. Multim. 21(5): 1248-1260 (2019)
-
Hongtao Xie, Shancheng Fang, Zheng-Jun Zha, Yating Yang, Yan Li, Yongdong Zhang:
Convolutional Attention Networks for Scene Text Recognition. ACM Trans. Multim. Comput. Commun. Appl. 15(1s): 3:1-3:17 (2019)
-
Chong Wang, Zheng-Jun Zha, Dong Liu, Hongtao Xie:
Robust Deep Co-Saliency Detection with Group Semantic. AAAI 2019: 8917-8924
-
Shaobo Min, Xuejin Chen, Hongtao Xie, Zheng-Jun Zha, Guoqiang Bi, Feng Wu, Yongdong Zhang:
Accurate Segmentation of Synaptic Cleft with Contour Growing Concatenated with a Convnet. ICIP 2019: 1420-1424
-
Yu Zhou, Shancheng Fang, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
MLTS: A Multi-Language Scene Text Spotter. ICME 2019: 163-168
-
Fanchao Lin, Chuanbin Liu, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
Semantic-Embedding and Shape-Aware U-Net for Ultrasound Eyeball Segmentation. ICME 2019: 892-897
-
Shancheng Fang, Hongtao Xie, Jianjun Chen, Jianlong Tan, Yongdong Zhang:
Learning to Draw Text in Natural Images with Conditional Adversarial Networks. IJCAI 2019: 715-722
-
Yuxin Wang, Hongtao Xie, Zilong Fu, Yongdong Zhang:
DSRN: A Deep Scale Relationship Network for Scene Text Detection. IJCAI 2019: 947-953
-
Weijian Chen, Yulong Gu, Zhaochun Ren, Xiangnan He, Hongtao Xie, Tong Guo, Dawei Yin, Yongdong Zhang:
Semi-supervised User Profiling with Heterogeneous Graph Attention Networks. IJCAI 2019: 2116-2122
-
Hai Xu, Hongtao Xie, Yizhi Liu, Chuandong Cheng, Chaoshi Niu, Yongdong Zhang:
Deep Cascaded Attention Network for Multi-task Brain Tumor Segmentation. MICCAI (3) 2019: 420-428
-
Chuanbin Liu, Hongtao Xie, Sicheng Zhang, Jingyuan Xu, Jun Sun, Yongdong Zhang:
Misshapen Pelvis Landmark Detection by Spatial Local Correlation Mining for Diagnosing Developmental Dysplasia of the Hip. MICCAI (6) 2019: 441-449
-
Chuanbin Liu, Hongtao Xie, Yizhi Liu, Zheng-Jun Zha, Fanchao Lin, Yongdong Zhang:
Extract Bone Parts Without Human Prior: End-to-end Convolutional Neural Network for Pediatric Bone Age Assessment. MICCAI (6) 2019: 667-675
-
Yanhao Zhu, Zhineng Chen, Shuai Zhao, Hongtao Xie, Wenming Guo, Yongdong Zhang:
ACE-Net: Biomedical Image Segmentation with Augmented Contracting and Expansive Paths. MICCAI (1) 2019: 712-720
-
Tianhao Yang, Zheng-Jun Zha, Hongtao Xie, Meng Wang, Hanwang Zhang:
Question-Aware Tube-Switch Network for Video Question Answering. ACM Multimedia 2019: 1184-1192
-
Shaobo Min, Hantao Yao, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
Domain-Specific Embedding Network for Zero-Shot Recognition. ACM Multimedia 2019: 2070-2078
-
Shaobo Min, Hongtao Xie, Youliang Tian, Hantao Yao, Yongdong Zhang:
Adaptive Bilinear Pooling for Fine-grained Representation Learning. MMAsia 2019: 2:1-2:6
-
Hai Wu, Hongtao Xie, Fanchao Lin, Sicheng Zhang, Jun Sun, Yongdong Zhang:
WaveCSN: Cascade Segmentation Network for Hip Landmark Detection. MMAsia 2019: 18:1-18:6
-
Xierong Zhu, Jiawei Liu, Hongtao Xie, Zheng-Jun Zha:
Adaptive Alignment Network for Person Re-identification. MMM (2) 2019: 16-27
2018
-
Shancheng Fang, Hongtao Xie, Zhineng Chen, Yizhi Liu, Yan Li:
Uyghur Text Matching in Graphic Images for Biomedical Semantic Analysis. Neuroinformatics 16(3-4): 445-455 (2018)
-
Chenggang Yan, Hongtao Xie, Shun Liu, Jian Yin, Yongdong Zhang, Qionghai Dai:
Effective Uyghur Language Text Detection in Complex Background Images for Traffic Prompt Identification. IEEE Trans. Intell. Transp. Syst. 19(1): 220-229 (2018)
-
Chenggang Yan, Hongtao Xie, Dongbao Yang, Jian Yin, Yongdong Zhang, Qionghai Dai:
Supervised Hash Coding With Deep Neural Network for Environment Perception of Intelligent Vehicles. IEEE Trans. Intell. Transp. Syst. 19(1): 284-295 (2018)
-
Chenggang Yan, Hongtao Xie, Jianjun Chen, Zheng-Jun Zha, Xinhong Hao, Yongdong Zhang, Qionghai Dai:
A Fast Uyghur Text Detector for Complex Background Images. IEEE Trans. Multim. 20(12): 3389-3398 (2018)
-
Nannan Sun, Dongbao Yang, Shancheng Fang, Hongtao Xie:
Deep Convolutional Nets for Pulmonary Nodule Detection and Classification. KSEM (2) 2018: 197-208
-
Shancheng Fang, Hongtao Xie, Zheng-Jun Zha, Nannan Sun, Jianlong Tan, Yongdong Zhang:
Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence Modeling. ACM Multimedia 2018: 248-256
-
Jiawei Liu, Zheng-Jun Zha, Hongtao Xie, Zhiwei Xiong, Yongdong Zhang:
CA3Net: Contextual-Attentional Attribute-Appearance Network for Person Re-Identification. ACM Multimedia 2018: 737-745
-
Jianjun Chen, Hongtao Xie, Yue Hu, Chenggang Yan:
Uyghur Text Localization with Fast Component Detection. MMM (1) 2018: 565-577
-
Di Chen, Zheng-Jun Zha, Jiawei Liu, Hongtao Xie, Yongdong Zhang:
Temporal-Contextual Attention Network for Video-Based Person Re-identification. PCM (1) 2018: 146-157
-
Zhihua Shang, Zilong Fu, Chuanbin Liu, Hongtao Xie, Yongdong Zhang:
Potential of Attention Mechanism for Classification of Optical Coherence Tomography Images. VCIP 2018: 1-4
- 张勇东,刘传彬,谢洪涛,李岩. 弱监督细粒度物体分类方法. 申请号:201910019867.4。
- 张勇东,林凡超,谢洪涛. 超声图像中眼球区域分割方法. 申请号:201910238410.2。
- 张勇东,徐海,谢洪涛. 脑胶质瘤区域自动分割方法. 申请号:201910567803.8。
- 张勇东,刘传彬,谢洪涛. 骨骼年龄评估方法. 申请号:201910568724.9。
- 张勇东,闵少波,谢洪涛,李岩. 细粒度图像零样本识别方法. 申请号:201910032246.X。
- 张勇东,尚志华,谢洪涛,李岩. 利用少数标注图像生成分类器的方法. 申请号:201910235392.2。
- 张勇东,周宇,谢洪涛,李岩. 多语言文本检测识别系统. 申请号:201910232853.0。
- 张勇东,闵少波,谢洪涛. 图像中目标分割的方法. 申请号:201811478643.1。
- 张勇东,徐静远,武海,谢洪涛. 髋关节X光图像快速自动分析方法. 申请号:201811389123.3。
- 张勇东,尚志华,武海,谢洪涛. 髋关节X光图像快速自动分析方法. 申请号:201811421819.5。
- 张勇东,刘传彬,武海,谢洪涛. 髋关节X光图像快速自动分析方法. 申请号:201811421818.5。
- 张勇东,王裕鑫,谢洪涛. 一种基于堆叠式全卷积神经网络的肺部血管分割方法. 申请号:201811384307.0。
- 张勇东,闵少波,谢洪涛. 细粒度图像分类方法. 申请号:201811210182.X。
- 张勇东,符子龙,尚志华,谢洪涛. 基于深度学习的视网膜OCT图像分类方法. 申请号:201811103949.9。
- 谢洪涛,张勇东. 一种基于卷积注意力网络的自然场景文本识别方法. 申请号:201810437763.0。
- 谢洪涛,张勇东. 一种快速的复杂背景图像中维语文字定位方法. 申请号:201810375055.9。
- 谢洪涛,张勇东. 基于2D卷积神经网络的肺结节检测方法. 申请号:201810496332.1。
- 张勇东、颜成钢、谢洪涛、唐金辉、唐胜 . 互联网视频流的高通量计算理论与方法 . 2019年国家自然科学奖二等奖。
- 张勇东、尚志华、谢洪涛、李岩. 利用少数标注图像生成分类器的方法. 2021年中国专利奖优秀奖。
- 张勇东、颜成钢、谢洪涛、唐金辉、唐胜 . 互联网视频的高效流式计算理论与方法 . 2018年中国电子学会科学技术奖一等奖(自然科学类)。
- 谢洪涛. 2022年度中国图象图形学学会青年科学家奖。
Top