Details - 北京工业大学机构库

Query：

学者姓名：孔德慧

Refining：

Year

2024 (2)
2023 (9)
2021 (24)
2020 (18)
2019 (27)
2018 (27)
2017 (17)
2016 (9)
2015 (45)
2014 (26)
2013 (13)
2012 (28)
2011 (12)
2010 (5)
2009 (24)
2008 (8)
2007 (15)
2006 (18)
2005 (9)
2004 (10)
2003 (15)
2002 (3)
2001 (1)
2000 (3)

Submit Unfold

Type

专利 (159)
期刊论文 (136)
会议论文 (73)

Submit Unfold

Indexed by

incoPat (159)
Scopus (130)
EI (118)
CSCD (71)
PKU (69)
CNKI (48)
万方 (47)
CQVIP (40)
SCIE (39)
CPCI-S (33)

Submit Unfold

Source

Journal of Beijing University of Technology (23)
北京工业大学学报 (21)
Journal of Information and Computational Science (15)
IEEE TRANSACTIONS ON MULTIMEDIA (7)
MULTIMEDIA TOOLS AND APPLICATIONS (7)
系统仿真学报 (7)
4th International Conference on Digital Home, ICDH 2012 (5)
5th International Conference on Digital Home (ICDH) (4)
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (4)
Journal of System Simulation (4)
International Conference on Neural Networks and Signal Processing (3)
第十七届全国计算机辅助设计与图形学学术会议(CAD/CG’ 2012)暨第九届全国智能CAD与数字娱乐学术会议(CID’ 2012) (3)
10th Pacific Rim Conference on Multimedia (2)
10th Pacific Rim Conference on Multimedia, PCM 2009 (2)
2009 International Conference on Computational Intelligence and Software Engineering, CiSE 2009 (2)
APPLIED SCIENCES-BASEL (2)
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING (2)
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS (2)
NEUROCOMPUTING (2)
中国图象图形学报A辑 (2)
图学学报 (2)
第十二届中国虚拟现实大会 (2)
计算机工程与应用 (2)
计算机教育 (2)
计算机研究与发展 (2)
11th EAI International Conference on Simulation Tools and Techniques, SIMUTools 2019 (1)
19th International Conference on Advances in Multimedia Modeling, MMM 2013 (1)
1st International Conference on Communications and Information Processing (ICCIP 2012) (1)
1st International Congress on Image and Signal Processing (1)
2004 IEEE International Conference on Multimedia and Expo (ICME) (1)
2008 9th International Conference on Signal Processing, ICSP 2008 (1)
2010国际计算机科学技术与应用论坛 (1)
2012 IEEE International Conference on Computer Science and Automation Engineering, CSAE 2012 (1)
2012 IEEE Symposium on Electrical and Electronics Engineering, EEESYM 2012 (1)
2012 International Conference of Intelligence Computation and Evolutionary Computation, ICEC 2012 (1)
2012 International Conference on Communications and Information Processing, ICCIP 2012 (1)
2012 International Conference on Computer Science and Service System, CSSS 2012 (1)
2013 5th International Conference on Computational and Information Sciences, ICCIS 2013 (1)
2018第12届全国计算机图形学大会Chinagraph 2018 (1)
30th International Conference on Artificial Neural Networks (ICANN) (1)
3rd International Conference on Computer Science and Service System (CSSS) (1)
3rd International Conference on Natural Computation (ICNC 2007) (1)
4th International Conference on Image and Graphics (1)
5th International Conference on Visual Information Engineering, VIE 2008 (1)
6th International Conference on Advanced Language Processing and Web Information Technology (1)
6th International Conference on Digital Home, ICDH 2016 (1)
6th World Congress on Intelligent Control and Automation (1)
6th World Congress on Intelligent Control and Automation, WCICA 2006 (1)
7th International Conference on Digital Home (ICDH) (1)
8th IEEE International Conference on Computer and Information Technology (1)
8th International Conference on Signal Processing (1)
8th International Conference on Signal Processing, ICSP 2006 (1)
9th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) (1)
9th IEEE Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2018 (1)
9th International Conference on Signal Processing (1)
9th Pacific Conference on Computer Graphics and Applications, Pacific Graphics 2001 (1)
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (1)
CHINA COMMUNICATIONS (1)
Conference on Computer Graphics, Imaging and Visualisation (1)
Frontiers of Information Technology & Electronic Engineering (1)
IADIS International Conference Computer Graphics, Visualization, Computer Vision and Image Processing 2011, Part of the IADIS Multi Conference on Computer Science and Information Systems 2011, MCCSIS 2011 (1)
IEEE International Conference on Acoustics, Speech, and Signal Processing (1)
IEEE International Conference on Multimedia and Expo (ICME 2007) (1)
IEEE International Conference on Multimedia and Expo (ICME) (1)
IEEE TRANSACTIONS ON BIG DATA (1)
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (1)
IEEE TRANSACTIONS ON CYBERNETICS (1)
IEEE TRANSACTIONS ON IMAGE PROCESSING (1)
IEEE/WIC International Conference on Web Intelligence (WI 2003) (1)
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING (1)
International Conference of Intelligence Computation and Evolutionary Computation (ICEC 2012) (1)
International Conference on Computational-Intelligence and Security (1)
International Conference on Computer Science and Software Engineering, CSSE 2008 (1)
International Conference on Software Technology and Engineering (1)
International Journal of Advancements in Computing Technology (1)
International Journal of Digital Multimedia Broadcasting (1)
International Journal of Simulation: Systems, Science and Technology (1)
International Workshop on Information and Electronics Engineering (IWIEE) / International Conference on Information, Computing and Telecommunications (ICICT) (1)
JOURNAL ON MULTIMODAL USER INTERFACES (1)
Journal of Fiber Bioengineering and Informatics (1)
Journal of Optoelectronics Laser (1)
Journal of Software (1)
MATHEMATICAL PROBLEMS IN ENGINEERING (1)
Proceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing (1)
SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES (1)
SIAM JOURNAL ON DISCRETE MATHEMATICS (1)
VISUAL COMPUTER (1)
WIRELESS NETWORKS (1)
中国图象图形学报 (1)
中国科学F辑 (1)
中国科学（F辑:信息科学） (1)
光电子·激光 (1)
全国第16届计算机科学与技术应用（CACIS）学术会议 (1)
华南理工大学学报(自然科学版) (1)
孔德慧 (1)
第一届中国情感计算及智能交互学术会议 (1)
第六届全国几何设计与计算学术会议 (1)
第十五届中国虚拟现实大会暨虚拟现实与可视化技术国际会议 (1)
计算机学报 (1)
计算机工程 (1)
计算机应用 (1)
软件学报 (1)

Submit Unfold

Complex

First Author (34)
Reprint Author (7)
First Comm (59)
Reprint Comm (59)

Submit Unfold

Co-Author

尹宝才 (131)
Yin, Baocai (86)
王立春 (46)
王少帆 (42)
李敬华 (36)
Wang, Shaofan (34)
Zhang, Yong (29)
Li, Jinghua (25)
张勇 (23)
Yin, Bao-Cai (22)
Wang, Lichun (21)
Shi, Yunhui (15)
YIN Bao-cai (13)
孙艳丰 (13)
王玉萍 (8)
蔡鹏 (8)
Wang, Ru (7)
Li, Xin (6)
Yin, BC (6)
王雁来 (6)
高荣华 (6)
Cai, Peng (5)
Ding, Wenpeng (5)
Sun, Bin (5)
Sun, Yanfeng (5)
Wang, Li-Chun (5)
Yin Baocai (5)
Yin, B.-C. (5)
施云惠 (5)
薛娟 (5)
Chen, Dongpan (4)
Gao, Junna (4)
Liu, Caixia (4)
Lu, Tailong (4)
Xue, Juan (4)
司慧琳 (4)
孙磊 (4)
张楠 (4)
肖小芳 (4)
郭金铜 (4)
Du, Xiaohui (3)
Huo, Yi (3)
Ji, Peng-Fei (3)
Si, Huilin (3)
Sun, Lei (3)
Wang, Pengcheng (3)
Wang, Yuping (3)
Wu, Qianjun (3)
Wu, Yongpeng (3)
Xiao, Xiao-Fang (3)
Zang, Yuding (3)
季鹏飞 (3)
徐振华 (3)
杜晓晖 (3)
淮华瑞 (3)
王文通 (3)
田鹏宇 (3)
胡永利 (3)
荣子豪 (3)
谷春亮 (3)
赵欣欣 (3)
闫会霞 (3)
霍奕 (3)
Gao, Junbin (2)
Gao, Ronghua (2)
Guo, Jing-Wei (2)
Guo, Jin-Tong (2)
Huang, Qingming (2)
Jing, Guodong (2)
Kang, Liang (2)
Kuang, Yun (2)
Li, Qianxing (2)
Li, Yan (2)
Rong, Zihao (2)
Si, Hui-Lin (2)
Sun, Xiaowei (2)
Tian, Pengyu (2)
Wang, Ke (2)
Wang Lichun (2)
Wang, YL (2)
Wang, Zhiyong (2)
Xue, J. (2)
Xu, Zhen-Hua (2)
Yang, Guang-Wei (2)
Zhang, Juan (2)
Zhang, Y. (2)
Zhang, Yang (2)
ZHANG Yong (2)
Zhao, Xinxin (2)
信建佳 (2)
刘媛媛 (2)
刘彩霞 (2)
刘蓬燕 (2)
张雯晖 (2)
李文超 (2)
李爽 (2)
林菁 (2)
樊东灵 (2)
王文东 (2)
王玉田 (2)
王珂 (2)
王茹 (2)
胡玉杰 (2)
谭斐 (2)
贾思宇 (2)
马淑燕 (2)
Baocai, Yin (1)
BaoCai, Yin (1)
Cai, P. (1)
CAI Peng (1)
Cheng, Shi-Quan (1)
Chen Ran (1)
Chen, T.-B. (1)
Deng, Zhengjie (1)
Du, Xiao-Hui (1)
Gao, R.-H. (1)
Gao, Rong-Hua (1)
Gu, Chun-Liang (1)
Guo, Yaxin (1)
Huai, Huarui (1)
Huang, W.-J. (1)
Huang, Yaoda (1)
Hu, YL (1)
Hu, Yong-Li (1)
Jia, XB (1)
Jia, Xibin (1)
Jinghua, Li (1)
Jin, Wei (1)
Li, Chun (1)
Li, Chunjing (1)
Lichun, Wang (1)
Li, Jiazhen (1)
Li jinghua (1)
Li, Jing-Hua (1)
Li, Lanxiao (1)
Li, Li-Yan (1)
Li, Mine (1)
Li, Shuang (1)
Li, Shuo (1)
Liu, Honglin (1)
Liu, Panbiao (1)
Liu, Wentao (1)
Liu, WT (1)
Li, Xuelong (1)
Lu, Bo-Xue (1)
Luo, XiaoNan (1)
Nan, Z (1)
Qin, Xu-Guo (1)
RongHua, Gao (1)
Roth, Hubert (1)
Ruan, Xiaogang (1)
Shen, Bowei (1)
Shi, Lina (1)
Shi, Yun-Hui (1)
Si, H.-L. (1)
Song, Cai-Fang (1)
Sun, Bo (1)
Wang, Huai-Bin (1)
Wang, Jin (1)
Wang, K (1)
Wang lichun (1)
Wang, Peng-Tao (1)
Wang, Renhong (1)
Wang Ru (1)
Wang, Shao-fan (1)
Wang, WD (1)
Wang, Wen-Dong (1)
Wang wentong (1)
Wang, Xiaotian (1)
Wang, Yanlai (1)
Wang, Yan Lai (1)
Wang, Yufei (1)
Wang, Yu-Ping (1)
Wang, Yu-Tian (1)
Wang, Zhen (1)
Wen, Wen (1)
Wu, S.-N. (1)
Xia, Ting-Ting (1)
Xin, Li (1)
Xin, Yongjia (1)
Xiong, Ruiqin (1)
XUE Juan (1)
Xu, Min (1)
Yan, Huixia (1)
Yin baocai (1)
Yin, Bao-cai (1)
Yin, BaoCai (1)
Yin, Bao Cai (1)
Yingxin, Xing (1)
Yong Zhang (1)
Yue, Wenying (1)
Yue, WY (1)
Zhang, J. (1)
Zhang, Nan (1)
Zhang, Wenhui (1)
Zhang, Xiangwu (1)
Zhang, Zhen (1)
Zheng, Chong-Yu (1)
Zhu, Weijia (1)
于沁杨 (1)
代晋玮 (1)
侯振英 (1)
冯会晓 (1)
刘洋 (1)
刘洪林 (1)
刘浩 (1)
刘润泽 (1)
北京工业大学学报 (1)
吕博学 (1)
吴思宁 (1)
吴鑫 (1)
夏婷婷 (1)
孙彬 (1)
孙文胜 (1)
孙晓伟 (1)
孙杰 (1)
孙首乙 (1)
岳文颖 (1)
左琳 (1)
巩林昊 (1)
巫乾军 (1)
张娟 (1)
张少杰 (1)
张彬 (1)
张洋 (1)
张涛 (1)
张相武 (1)
文雯 (1)
朱江 (1)
朱碧焓 (1)
朱维佳 (1)
李倩星 (1)
李华阳 (1)
李志明 (1)
李新海 (1)
李淳 (1)
李燕 (1)
李素琴 (1)
杨光伟 (1)
杨臣 (1)
梁宇辰 (1)
段学浩 (1)
毛猛 (1)
汪洋 (1)
沈伯伟 (1)
王国良 (1)
王帅 (1)
王志勇 (1)
王怀彬 (1)
王振 (1)
王鹏涛 (1)
田岳 (1)
石丽娜 (1)
程世铨 (1)
程可 (1)
线冰曦 (1)
蒋春燕 (1)
虞义兰 (1)
贾文浩 (1)
赵爽 (1)
辛永佳 (1)
邢迎新 (1)
邬玉洁 (1)
邱鹏飞 (1)
邵广翠 (1)
邹自强 (1)
郑重雨 (1)
郝晨辉 (1)
陈思 (1)
陈晟 (1)
陈通波 (1)
霍光煜 (1)
马春玲 (1)
马杨 (1)
马胜蕾 (1)
高宁 (1)
高明 (1)
黄万军 (1)

Submit Unfold

Language

Chinese (246)
English (121)
Other (1)

Submit

Clean All

Select All Sort by：

Default

Default
Title
Year
WOS Cited Count
Impact factor
Ascending
Descending

< Page ，Total 37 >

Joint multi-scale transformers and pose equivalence constraints for 3D human pose estimation SCIE

期刊论文 | 2024 , 103 | JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION

Wu, Yongpeng | Kong, Dehui | Gao, Junna | Li, Jinghua | Yin, Baocai

Abstract&Keyword Cite

Abstract ：

Different from image-based 3D pose estimation, video-based 3D pose estimation gains performance improvement with temporal information. However, these methods still face the challenge of insufficient generalization ability, including human motion speed, body shape, and camera distance. To address the above problems, we propose a novel approach, referred to as joint Spatial-temporal Multi-scale Transformers and Pose Transformation Equivalence Constraints (SMT-PTEC) for 3D human pose estimation from videos. We design a more general spatial-temporal multi-scale feature extraction strategy, and introduce optimization constraints that adapt to the diversity of data to improve the accuracy of pose estimation. Specifically, we first introduce a spatial multi-scale transformer to extract multi-scale features of pose and establish a cross-scale information transfer mechanism, which effectively explores the underlying knowledge of human motion. Then, we present a temporal multi-scale transformer to explore multi-scale dependencies between frames, enhance the adaptability of the network to human motion speed, and improve the estimation accuracy through a context aware fusion of multi-scale predictions. Moreover, we add pose transformation equivalence constraints by changing the training samples with horizontal flipping, scaling, and body shape transformation to effectively overcome the influence of camera distance and body shape for the prediction accuracy. Extensive experimental results demonstrate that our approach achieves superior performance with less computational complexity than previous state-of-the-art methods. Code is available at https://github.com/JNGao123/SMT-PTEC.

Keyword ：

Spatial-temporal multi-scale Spatial-temporal multi-scale Transformer Transformer Pose transformation equivalence Pose transformation equivalence Pose estimation Pose estimation

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wu, Yongpeng , Kong, Dehui , Gao, Junna et al. Joint multi-scale transformers and pose equivalence constraints for 3D human pose estimation [J]. \| JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION , 2024 , 103 .
MLA	Wu, Yongpeng et al. "Joint multi-scale transformers and pose equivalence constraints for 3D human pose estimation" . \| JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION 103 (2024) .
APA	Wu, Yongpeng , Kong, Dehui , Gao, Junna , Li, Jinghua , Yin, Baocai . Joint multi-scale transformers and pose equivalence constraints for 3D human pose estimation . \| JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION , 2024 , 103 .
Export to	NoteExpress RIS BibTex

OASNet: Object Affordance State Recognition Network With Joint Visual Features and Relational Semantic Embeddings SCIE

期刊论文 | 2024 , 34 (5) , 3368-3382 | IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

Abstract&Keyword Cite

Abstract ：

Traditional affordance learning tasks aim to understand object's interactive functions in an image, such as affordance recognition and affordance detection. However, these tasks cannot determine whether the object is currently interacting, which is crucial for many follow-up tasks, including robotic manipulation and planning task. To fill this gap, this paper proposes a novel object affrodance state (OAS) recognition task, i.e., simultaneously recognizing an object's affordances and the partner objects that are interacting with it. Accordingly, to facilitate the application of deep learning technology, an OAS recognition task related dataset OAS10k is constructed by collecting and labeling over 10k images. In the dataset, a sample is defined as a set of an image and its OAS labels, each label is represented as $\left \langle{ \rm {\textit {subject, subject's affrodance, interacted object}} }\right \rangle $ . These triplet labels have rich relational semantic information, which can improve OAS recognition performance. We hence construct a directed OAS knowledge graph of affordance states, and extract an OAS matrix from it for modelling the semantic relationships of the triplets. Based on the matrix, we propose an OAS recognition network (OASNet), which utilizes GCN to capture the relational semantic embeddings, and uses a transformer to fuse them with the visual features from an image to recognize the affordance states of objects in the image. Experimental results on OAS10k dataset and other triplet label recognition datasets demonstrate that the proposed OASNet achieves the best performance compared to the state-of-the-art methods. The dataset and codes will be released on https://github.com/mxmdpc/OAS.

Keyword ：

Object affordance state recognition Object affordance state recognition transformer transformer multi-label image classification multi-label image classification relational semantic embeddings relational semantic embeddings graph convolution networks graph convolution networks

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Chen, Dongpan , Kong, Dehui , Li, Jinghua et al. OASNet: Object Affordance State Recognition Network With Joint Visual Features and Relational Semantic Embeddings [J]. \| IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY , 2024 , 34 (5) : 3368-3382 .
MLA	Chen, Dongpan et al. "OASNet: Object Affordance State Recognition Network With Joint Visual Features and Relational Semantic Embeddings" . \| IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 34 . 5 (2024) : 3368-3382 .
APA	Chen, Dongpan , Kong, Dehui , Li, Jinghua , Wang, Lichun , Gao, Junna , Yin, Baocai . OASNet: Object Affordance State Recognition Network With Joint Visual Features and Relational Semantic Embeddings . \| IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY , 2024 , 34 (5) , 3368-3382 .
Export to	NoteExpress RIS BibTex

一种基于空间骨架信息的手绘草图三维模型重建方法 incoPat

专利 | 2023-02-24 | CN202310163381.4

孔德慧 | 马杨 | 李敬华 | 尹宝才

Abstract&Keyword Cite

Abstract ：

本发明公开了一种基于空间骨架信息的手绘草图三维模型重建方法，提出空间骨架引导编码器、域自适应编码器和自注意力解码器，通过空间骨架编码器提取草图的骨架特征，骨架信息作为一种先验知识来提供重建完整三维模型所需的辅助信息，域自适应编码器将合成草图学习到的知识迁移到手绘草图中，基于注意力的解码器消除歧义性，本方法提升了单张手绘草图的三维重建精度。自注意力机制使得模型区分轮廓相似度较高的草图输入；相对于其他技术使用判别器与梯度反转层的域自适应方法，其训练的值函数相当于最小化两个分布之间的Jensen‑Shannon散度，因为最小化的散度对于生成器参数来说可能不是连续的，而本发明的域自适应约束函数可被认为处处可微，训练更加稳定。

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	孔德慧 , 马杨 , 李敬华 et al. 一种基于空间骨架信息的手绘草图三维模型重建方法 : CN202310163381.4[P]. \| 2023-02-24 .
MLA	孔德慧 et al. "一种基于空间骨架信息的手绘草图三维模型重建方法" : CN202310163381.4. \| 2023-02-24 .
APA	孔德慧 , 马杨 , 李敬华 , 尹宝才 . 一种基于空间骨架信息的手绘草图三维模型重建方法 : CN202310163381.4. \| 2023-02-24 .
Export to	NoteExpress RIS BibTex

一种多视角加权聚合的三维点云重建方法 incoPat

专利 | 2023-02-24 | CN202310195559.3

孔德慧 | 张少杰 | 李敬华 | 尹宝才

Abstract&Keyword Cite

Abstract ：

本发明公开了一种多视角加权聚合的三维点云重建方法，由非局部特征提取器对输入图像进行处理得到特征图，利用单应性变换对特征图进行变换以生成多个成本体，通过轻量化的加权聚合模块将成本体之间的三维关系进行编码生成一个三维成本体，使用边缘语义引导的伪三维卷积回归网络对三维成本体进行深度回归得到多视角深度图，最后利用相机矩阵参数反映射计算生成点云。通过提出基于空洞卷积的非局部特征提取器来提升点云的完整度；通过构建轻量化的加权聚合模块来提升点云的精度并降低网络的计算量；通过提出边缘语义引导的伪三维卷积网络来提升多视角三维重建的重建精度和降低对硬件的要求。

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	孔德慧 , 张少杰 , 李敬华 et al. 一种多视角加权聚合的三维点云重建方法 : CN202310195559.3[P]. \| 2023-02-24 .
MLA	孔德慧 et al. "一种多视角加权聚合的三维点云重建方法" : CN202310195559.3. \| 2023-02-24 .
APA	孔德慧 , 张少杰 , 李敬华 , 尹宝才 . 一种多视角加权聚合的三维点云重建方法 : CN202310195559.3. \| 2023-02-24 .
Export to	NoteExpress RIS BibTex

一种基于超图注意力的人体网格重建方法 incoPat

专利 | 2023-05-25 | CN202310600839.8

孔德慧 | 郝晨辉 | 李敬华 | 尹宝才

Abstract&Keyword Cite

Abstract ：

本发明公开了一种基于超图注意力的人体网格重建方法，提出基于超图的人体网格分层表示来形成含部件语义的人体网格表示模型，这种新的表示模型为人体网格重建提供结构基础；通过构建Body2Parts特征转移模块实现部件间特征的汇聚和与图像信息的融合，从部件层级去进行信息交互和融合，可支持部件层次的高质量人体重建；通过提出Part2Vertices特征转移模块实现部件特征和顶点特征的转移，以及利用超图注意力来细化顶点级的特征，以顶点为表示单元，在部件内进行特征传递，支持网格点层次的精细化人体重建。基于层级化人体网格表示模型的层级化重建方法，本发明实现了三维人体网格重建精度和计算代价的高性能折衷。

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	孔德慧 , 郝晨辉 , 李敬华 et al. 一种基于超图注意力的人体网格重建方法 : CN202310600839.8[P]. \| 2023-05-25 .
MLA	孔德慧 et al. "一种基于超图注意力的人体网格重建方法" : CN202310600839.8. \| 2023-05-25 .
APA	孔德慧 , 郝晨辉 , 李敬华 , 尹宝才 . 一种基于超图注意力的人体网格重建方法 : CN202310600839.8. \| 2023-05-25 .
Export to	NoteExpress RIS BibTex

一种基于人体交互意图信息的层级人物交互检测方法 incoPat

专利 | 2023-03-20 | CN202310266335.7

孔德慧 | 王帅 | 李敬华 | 尹宝才

Abstract&Keyword Cite

Abstract ：

本发明公开了一种基于人体交互意图信息的层级人物交互检测方法，分为1)目标检测：检测输入图像中的所有对象实例。2)人物交互检测：对图像中所有的对实例进行人物交互检测。通过视觉特征的设计抽象出人体注视信息来建模交互参与者关注的上下文区域；提出面向人体交互意图的人体姿态图构建，优化身体运动对交互检测的差异信息；使用人和物体之间的距离‑特征作为引导视觉距离特征的优化，提升人物交互检测算法的性能。

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	孔德慧 , 王帅 , 李敬华 et al. 一种基于人体交互意图信息的层级人物交互检测方法 : CN202310266335.7[P]. \| 2023-03-20 .
MLA	孔德慧 et al. "一种基于人体交互意图信息的层级人物交互检测方法" : CN202310266335.7. \| 2023-03-20 .
APA	孔德慧 , 王帅 , 李敬华 , 尹宝才 . 一种基于人体交互意图信息的层级人物交互检测方法 : CN202310266335.7. \| 2023-03-20 .
Export to	NoteExpress RIS BibTex

DASI: Learning Domain Adaptive Shape Impression for 3D Object Reconstruction SCIE

期刊论文 | 2023 , 25 , 5248-5262 | IEEE TRANSACTIONS ON MULTIMEDIA

Gao, Junna | Kong, Dehui | Wang, Shaofan | Li, Jinghua | Yin, Baocai

Abstract&Keyword Cite

Abstract ：

Previous 3D object reconstruction methods from 2D images involve two issues: the lack of in-depth exploration of the prior knowledge of 3D shapes, and the difficulty of dealing with the serious occluded parts. Inspired by human's perception on real-world objects which is composed of an overall impression (known as shape impression) and an enhanced cognition, we propose a deep network (denoted by DASI) to learn the Domain Adaptive Shape Impression for 3D reconstruction from arbitrary view images. DASI consists of two modules: shape reconstruction module and shape refinement module. The former module reconstructs a coarse volume by learning a domain adaptive shape impression as embedding in image-based reconstruction. We first leverage 3D objects to learn a shape impression being associated with prior knowledge of 3D objects. To attain consensus on shape impression from 2D images, we regard the 3D shape and the 2D image as two different domains. By adapting the two domains, the shape impression learned from 3D objects is transferred to 2D images and guides the images-based reconstruction. The latter module refines the objects by modeling the whole 3D volume to local 3D patches and exploring their intrinsic geometry relationships. Quantitative and qualitative experimental results on two benchmark datasets demonstrate that DASI outperforms several state-of-the-arts for 3D reconstruction from single and multi-view 2D images.

Keyword ：

deep learning deep learning transformer transformer 3D refinement 3D refinement domain adaptation domain adaptation 3D reconstruction 3D reconstruction

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Gao, Junna , Kong, Dehui , Wang, Shaofan et al. DASI: Learning Domain Adaptive Shape Impression for 3D Object Reconstruction [J]. \| IEEE TRANSACTIONS ON MULTIMEDIA , 2023 , 25 : 5248-5262 .
MLA	Gao, Junna et al. "DASI: Learning Domain Adaptive Shape Impression for 3D Object Reconstruction" . \| IEEE TRANSACTIONS ON MULTIMEDIA 25 (2023) : 5248-5262 .
APA	Gao, Junna , Kong, Dehui , Wang, Shaofan , Li, Jinghua , Yin, Baocai . DASI: Learning Domain Adaptive Shape Impression for 3D Object Reconstruction . \| IEEE TRANSACTIONS ON MULTIMEDIA , 2023 , 25 , 5248-5262 .
Export to	NoteExpress RIS BibTex

Multi-scale latent feature-aware network for logical partition based 3D voxel reconstruction SCIE

期刊论文 | 2023 , 533 , 22-34 | NEUROCOMPUTING

Abstract&Keyword Cite

Abstract ：

Although prior methods have achieved promising performance for recovering the 3D geometry from a single depth image, they tend to produce incomplete 3D shapes with noise. To this end, we propose Multi-Scale Latent Feature-Aware Network (MLANet) to recover the full 3D voxel grid from a single depth view of an object. MLANet logically represents a 3D voxel grid as visible voxels, occluded voxels and non-object voxels, and aims to the reconstruction of the latter two. Thus MLANet first introduces Multi-Scale Latent Feature-Aware (MLFA) based AutoEncoder (MLFA-AE) and a logical partition module to predict an occluded voxel grid (OccVoxGd) and a non-object voxel grid (NonVoxGd) from the visible voxel grid (VisVoxGd) corresponding to the input. MLANet then introduces MLFA based Generative Adversarial Network (MLFA-GAN) to refine the OccVoxGd and the NonVoxGd, and combines them with the VisVoxGd to generate a target 3D occupancy grid. MLFA shows a strong ability of learning multi-scale fea-tures of an object effectively and can be considered as a plug-and-play component to promote existing networks. The logical partition helps suppress NonVoxGd noise and improve OccVoxGd accuracy under adversarial constraints. Experimental studies on both synthetic and real-world data show that MLANet outperforms the state-of-the-art methods, and especially reconstructs unseen object categories with a higher accuracy.(c) 2023 Elsevier B.V. All rights reserved.

Keyword ：

Latent space Latent space Single depth view Single depth view Generative adversarial network Generative adversarial network Autoencoder Autoencoder Attention Attention 3D reconstruction 3D reconstruction

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Liu, Caixia , Kong, Dehui , Wang, Shaofan et al. Multi-scale latent feature-aware network for logical partition based 3D voxel reconstruction [J]. \| NEUROCOMPUTING , 2023 , 533 : 22-34 .
MLA	Liu, Caixia et al. "Multi-scale latent feature-aware network for logical partition based 3D voxel reconstruction" . \| NEUROCOMPUTING 533 (2023) : 22-34 .
APA	Liu, Caixia , Kong, Dehui , Wang, Shaofan , Li, Qianxing , Li, Jinghua , Yin, Baocai . Multi-scale latent feature-aware network for logical partition based 3D voxel reconstruction . \| NEUROCOMPUTING , 2023 , 533 , 22-34 .
Export to	NoteExpress RIS BibTex

ADOSMNet: a novel visual affordance detection network with object shape mask guided feature encoders SCIE

期刊论文 | 2023 , 83 (11) , 31629-31653 | MULTIMEDIA TOOLS AND APPLICATIONS

Chen, Dongpan | Kong, Dehui | Li, Jinghua | Wang, Shaofan | Yin, Baocai

Abstract&Keyword Cite

Abstract ：

Visual affordance detection aims to understand the functional attributes of objects, which is crucial for robots to achieve interactive tasks. Most existing affordance detection methods mainly utilize the global image features for affordance detection while do not fully exploit the features of local relevant objects in the image, which often leads to suboptimal detection accuracy under the interference of cluttered backgrounds and neighbour objects. Numerous researches have proved that the accuracy of affordance detection largely depends on the quality of extracted image features. In this paper, we propose a novel affordance detection network with object shape mask guided feature encoders. The masks play as an attention mechanism that enforce the network to focus on the shape regions of target objects in the image, which facilitate to obtain high-quality features. Specifically, we first propose a shape mask guided encoder, which uses masks to effectively locate all target objects so as to extract more expressive features. Based on the encoder, we then propose a dual enhance feature aggregation module, which consists of two branches. The first branch encodes the global features of the original image, while the second branch locates each local relevant object and encodes its precise features. Aggregating these features enhances the feature representation of each object, further improving feature quality and suppressing interference. Quantitative and qualitative evaluations compared with state-of-the-art methods demonstrate that the proposed method achieves superior performance on the two commonly used affordance detection datasets.

Keyword ：

Feature enhancement Feature enhancement Visual affordance detection Visual affordance detection Feature representation Feature representation Object shape mask Object shape mask Image segmentation Image segmentation

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Chen, Dongpan , Kong, Dehui , Li, Jinghua et al. ADOSMNet: a novel visual affordance detection network with object shape mask guided feature encoders [J]. \| MULTIMEDIA TOOLS AND APPLICATIONS , 2023 , 83 (11) : 31629-31653 .
MLA	Chen, Dongpan et al. "ADOSMNet: a novel visual affordance detection network with object shape mask guided feature encoders" . \| MULTIMEDIA TOOLS AND APPLICATIONS 83 . 11 (2023) : 31629-31653 .
APA	Chen, Dongpan , Kong, Dehui , Li, Jinghua , Wang, Shaofan , Yin, Baocai . ADOSMNet: a novel visual affordance detection network with object shape mask guided feature encoders . \| MULTIMEDIA TOOLS AND APPLICATIONS , 2023 , 83 (11) , 31629-31653 .
Export to	NoteExpress RIS BibTex

CIGNet: Category-and-Intrinsic-Geometry Guided Network for 3D coarse-to-fine reconstruction SCIE

期刊论文 | 2023 , 554 | NEUROCOMPUTING

Gao, Junna | Kong, Dehui | Wang, Shaofan | Li, Jinghua | Yin, Baocai

WoS CC Cited Count： 4

Abstract&Keyword Cite

Abstract ：

3D object reconstruction from arbitrary view intensity images is a challenging but meaningful research topic in computer vision. The main limitations of existing approaches are that they lack complete and efficient prior information and might not be able to deal with serious occlusion or partial observation of 3D objects, which may produce incomplete and unreliable reconstructions. To reconstruct structure and recover missing or unseen parts of objects, category prior and intrinsic geometry relation are particularly useful and necessary during the 3D reconstruction process. In this paper, we propose Category-and-Intrinsic-Geometry Guided Network (CIGNet) for 3D coarse-to-fine reconstruction from arbitrary view intensity images by leveraging category prior and intrinsic geometry relation. CIGNet combines a category prior guided reconstruction module with an intrinsic geometry relation guided refinement module. In the first reconstruction module, we leverage semantic class context by adding a supervision term over object categories to output coarse reconstructed results. In the second refinement module, we model the coarse 3D volumetric data as 2D slices and consider intrinsic geometry relations between them to design graph structures of coarse 3D volumes to finish the graph based refinement. CIGNet can accomplish high-quality 3D reconstruction tasks by exploring the intra-category characteristics of objects as well as the intrinsic geometry relations of each object, both of which serve as useful complements to the visual information of images, in a coarse-to-fine fashion. Extensive quantitative and qualitative experiments on a synthetic dataset ShapeNet and real-world datasets Pix3D, Statue Model Repository, and BlendedMVS indicate that CIGNet outperforms several state-of-the-art methods in terms of accuracy and detail recovery.

Keyword ：

Graph convolutional network Graph convolutional network 3D reconstruction 3D reconstruction Category prior Category prior Geometry perception Geometry perception 3D refinement 3D refinement

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Gao, Junna , Kong, Dehui , Wang, Shaofan et al. CIGNet: Category-and-Intrinsic-Geometry Guided Network for 3D coarse-to-fine reconstruction [J]. \| NEUROCOMPUTING , 2023 , 554 .
MLA	Gao, Junna et al. "CIGNet: Category-and-Intrinsic-Geometry Guided Network for 3D coarse-to-fine reconstruction" . \| NEUROCOMPUTING 554 (2023) .
APA	Gao, Junna , Kong, Dehui , Wang, Shaofan , Li, Jinghua , Yin, Baocai . CIGNet: Category-and-Intrinsic-Geometry Guided Network for 3D coarse-to-fine reconstruction . \| NEUROCOMPUTING , 2023 , 554 .
Export to	NoteExpress RIS BibTex

10| 20| 50 per page

< Page ，Total 37 >

Type
Departments

All Years Choose Year From to