Details - 北京工业大学机构库

Query：

学者姓名：鲍长春

Refining：

Year

2024 (3)
2022 (1)
2021 (4)
2020 (14)
2019 (23)
2018 (26)
2017 (20)
2016 (26)
2015 (37)
2014 (35)
2013 (37)
2012 (23)
2011 (27)
2010 (22)
2009 (24)
2008 (35)
2007 (34)
2006 (25)
2005 (18)
2004 (18)
2003 (4)
2002 (5)
2001 (1)
1999 (3)
1998 (4)

Submit Unfold

Type

会议论文 (259)
期刊论文 (167)
专利 (43)

Submit Unfold

Indexed by

Scopus (190)
EI (182)
CSCD (121)
PKU (118)
CPCI-S (113)
CNKI (106)
CQVIP (85)
万方 (76)
incoPat (43)
SCIE (36)
PubMed (2)

Submit Unfold

Source

电子学报 (25)
信号处理 (14)
通信学报 (14)
Acta Electronica Sinica (13)
北京工业大学学报 (13)
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (10)
Journal of Beijing University of Technology (9)
Journal on Communications (7)
SPEECH COMMUNICATION (7)
11th IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2011 (6)
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING (6)
IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) (6)
第十四届全国信号处理学术年会(CCSP-2009) (6)
10th IEEE International Conference on Signal Processing, Communications and Computing (IEEE ICSPCC) (5)
2004 International Symposium on Chinese Spoken Language Processing (5)
2012 11th International Conference on Signal Processing, ICSP 2012 (5)
2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015 (5)
2nd Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2010 (5)
4th International Symposium on Chinese Spoken Language Processing (ISCSLP 2004) (5)
Asia-Pacific-Signal-and-Information-Processing-Association Annual Summit and Conference (APSIPA ASC) (5)
IEEE 11th International Conference on Signal Processing (ICSP) (5)
IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) (5)
第十二届全国信号处理学术年会（CCSP-2005） (5)
2008 9th International Conference on Signal Processing, ICSP 2008 (4)
2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019 (4)
9th International Conference on Signal Processing (4)
Annual Summit and Conference of the Asia-Pacific-Signal-and-Information-Processing-Association (APSIPA ASC) (4)
数据采集与处理 (4)
第十三届全国信号处理学术年会（CCSP-2007） (4)
第十六届全国信号处理学术年会及产业发展大会 (4)
计算机工程与应用 (4)
14th IEEE International Conference on Signal Processing (ICSP) (3)
14th IEEE International Conference on Signal Processing, ICSP 2018 (3)
2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 (3)
5th IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2015 (3)
9th Annual Summit and Conference of the Asia-Pacific-Signal-and-Information-Processing-Association (APSIPA ASC) (3)
9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017 (3)
Annual Summit and Conference of Asia-Pacific-Signal-and-Information-Processing-Association (APSIPA) (3)
Annual Summit and Conference of the Asia-Pacific-Signal-and-Information-Processing-Association (APSIPA) (3)
IEEE China Summit & International Conference on Signal and Information Processing (3)
IEEE China Summit and International Conference on Signal and Information Processing, ChinaSIP 2015 (3)
IEEE International Conference on Signal Processing, Communications and Computing ICSPCC 2015 (3)
Journal of Electronics and Information Technology (3)
Journal of Tsinghua University (3)
电声技术 (3)
电子与信息学报 (3)
第九届全国人机语音通讯学术会议 (3)
10th International Symposium on Chinese Spoken Language Processing (ISCSLP) (2)
10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016 (2)
14th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2013) (2)
15th IEEE International Conference on Signal Processing (ICSP) (2)
16th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2015) (2)
16th International Workshop on Acoustic Signal Enhancement (IWAENC) (2)
16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 (2)
18th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2017) (2)
2009 International Conference on Wireless Communications and Signal Processing, WCSP 2009 (2)
2011 International Conference on Wireless Communications and Signal Processing, WCSP 2011 (2)
2012 35th International Conference on Telecommunications and Signal Processing, TSP 2012 (2)
2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 (2)
2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013 (2)
2013 IEEE China Summit and International Conference on Signal and Information Processing, ChinaSIP 2013 (2)
2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2016 (2)
2019 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2019 (2)
21st IEEE International Workshop on Machine Learning for Signal Processing (MLSP) (2)
21st IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2011 (2)
2nd IEEE China Summit / International Conference on Signal and Information Processing (IEEE ChinaSIP) (2)
2nd IEEE China Summit and International Conference on Signal and Information Processing, IEEE ChinaSIP 2014 (2)
35th International Conference on Telecommunications and Signal Processing (TSP) (2)
4th IEEE International Conference on Computer and Communications, ICCC 2018 (2)
4th International Conference on Audio, Language and Image Processing, ICALIP 2014 (2)
5th International Conference on Audio, Language and Image Processing (ICALIP) (2)
5th International Conference on Audio, Language and Image Processing, ICALIP 2016 (2)
6th International Conference on Audio, Language and Image Processing, ICALIP 2018 (2)
7th IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2017 (2)
9th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2008) (2)
APPLIED ACOUSTICS (2)
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) (2)
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, APSIPA ASC 2011 (2)
CHINESE JOURNAL OF ELECTRONICS (2)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2)
IEEE International Conference on Acoustics, Speech, and Signal Processing (2)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2)
INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association (2)
International Conference on Audio, Language and Image Processing (2)
International Conference on Audio, Language and Image Processing (ICALIP) (2)
International Conference on Wireless Communications and Signal Processing (2)
Journal of Data Acquisition and Processing (2)
清华大学学报(自然科学版) (2)
清华大学学报（自然科学版） (2)
电子科学学刊 (2)
第九届全国信号处理学术年会（CCSP-99） (2)
第十三届全国人机语音通讯学术会议(NCMMSC2015) (2)
第十二届全国人机语音通讯学术会议(NCMMSC''2013) (2)
10th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 (1)
10th Asia-Pacific-Signal-and-Information-Processing-Association Annual Summit and Conference (APSIPA ASC) (1)
12th IEEE International Symposium on Signal Processing and Information Technolgy (ISSPIT) (1)
12th IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2012 (1)
13th IEEE International Conference on Signal Processing (ICSP) (1)
13th IEEE International Conference on Signal Processing, ICSP 2016 (1)
15th International Conference on Intelligent Computing, ICIC 2019 (1)
2004 7th International Conference on Signal Processing Proceedings (ICSP'04) (1)
2004 7th International Conference on Signal Processing Proceedings, ICSP (1)
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2006 (1)
2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (1)
2012 2nd IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2012 (1)
2012 4th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2012 (1)
2013 36th International Conference on Telecommunications and Signal Processing, TSP 2013 (1)
2013 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2013 (1)
2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014 (1)
2014 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2014 (1)
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019 (1)
2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 (1)
31st IEEE International Conference on Acoustics, Speech and Signal Processing (1)
36th International Conference on Telecommunications and Signal Processing (TSP) (1)
3rd Global Congress on Intelligent Systems (GCIS) (1)
3rd International Conference on Wireless Communication and Sensor Networks (WCSN) (1)
40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (1)
40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 (1)
41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (1)
41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 (1)
44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (1)
44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 (1)
5th International Conference on Machine Vision (ICMV) - Algorithms, Pattern Recognition and Basic Technologies (1)
7th International Conference on Signal Processing (1)
8th Annual Conference of the International Speech Communication Association, Interspeech 2007 (1)
8th International Conference on Signal Processing (1)
8th International Conference on Signal Processing, ICSP 2006 (1)
9th European Conference on Speech Communication and Technology (1)
9th IEEE International Symposium on Signal Processing and Information Technology (1)
9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP) (1)
APPLIED SCIENCES-BASEL (1)
APSIPA Transactions on Signal and Information Processing (1)
Asia-Pacific Signal and Information Processing Association 2009 Annual Summit and Conference, APSIPA ASC 2009 (1)
CHINA COMMUNICATIONS (1)
Chinese Journal of Scientific Instrument (1)
ELECTRONICS (1)
ELECTRONICS LETTERS (1)
IEEE 10TH International Conference on Signal Processing (1)
IEEE 10th International Conference on Signal Processing (1)
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (1)
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES (1)
Interspeech Conference 2007 (1)
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING (1)
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (1)
Journal of Information Hiding and Multimedia Signal Processing (1)
RADIOENGINEERING (1)
SENSORS (1)
北京生物医学工程 (1)
崔子豪 (1)
电气电子教学学报 (1)
电讯技术 (1)
第十一届全国信号处理学术年会（CCSP-2003） (1)
第十五届全国信号处理学术年会 (1)
计算机工程 (1)

Submit Unfold

Complex

First Author (71)
Reprint Author (32)
First Comm (192)
Reprint Comm (192)
CAS 2 (2)
CAS 3 (6)
CAS 4 (11)
JCR 1 (4)
JCR 2 (10)
JCR 3 (6)
JCR 4 (4)

Submit Unfold

Former Name

Bao, Changchun (134)
Bao, CC (7)
鲍长春 (162)
Chang-Chun, Bao (1)
Bao Chang-chun (1)
Bao, Chang-Chun (91)
Changchun, Bao (1)
Bao Changchun (3)
Bao, Chang-chun (55)
Bao, C.-C. (12)

Submit Unfold

Co-Author

Liu, Xin (32)
Jia, Mao-Shen (29)
贾懋珅 (35)
Bao, Feng (27)
Jia, Maoshen (25)
Bu, Bing (26)
Jia, Mao-shen (26)
Deng, Feng (22)
Wang, Xianyun (16)
刘鑫 (15)
李锐 (19)
Cheng, Rui (14)
Xia, Bing-Yin (10)
范睿 (19)
李如玮 (13)
李海婷 (14)
Xiang, Yang (12)
刘泽新 (15)
朱恒 (16)
夏丙寅 (10)
Li, Ru-Wei (10)
Sun, Jundai (9)
Cui, Zihao (8)
Liang, Yan (8)
Li, Rui (9)
Xia, Bingyin (9)
Xia, Bing-yin (9)
Zhang, Liyan (8)
Zhang, Xingtao (8)
Bai, Zhigang (7)
Chen, Hao (7)
Ritz, Christian (5)
李立雄 (13)
李靓 (7)
窦庚欣 (9)
马勇 (7)
Fan, Rui (6)
He, Qi (6)
Jia, M.-S. (6)
Li, Na (5)
Li, Xiao-Ming (5)
Rui, Rui (5)
Yan, Bofang (6)
Yuan, Jing (6)
徐昊 (5)
王贵平 (6)
Chen, Nan (4)
Dou, Hui-jing (4)
Dou, Hui-Jing (5)
Ma, Yong (4)
Wang, Dujuan (5)
Zhu, Heng (5)
Zhu, Rong (4)
张鹏 (5)
李晓明 (4)
白燕宁 (5)
窦慧晶 (4)
鲍枫 (4)
Hao, Yue (4)
He, Yu-Wen (2)
Huang, Qizheng (4)
Kleijn, W. Bastiaan (4)
Li, Jing (4)
Li, Ruwei (4)
Li, Ru-wei (3)
Wang, Wenbei (4)
Xu, Hao (4)
Yang, Yan (4)
Zha, Meng-fang (4)
Zhang, Jiaming (3)
Zhang, Peng (4)
Zhou, Xuan (2)
何玉文 (4)
周璇 (4)
白海钏 (4)
邓峰 (4)
陈悦 (4)
齐峰岩 (4)
Bai, Yan-Ning (3)
Bu, B. (3)
Dou, Geng-Xin (3)
Jia Maoshen (2)
Ji, Qiang (3)
Li, Hai-Ting (3)
Liu, Ze-Xin (3)
Li, Xiao-ming (3)
Qi, Feng-Yan (3)
Sha, Yong-Tao (3)
Wang, Qi (3)
Wang, Qing (3)
Wang, Song (3)
Wu, Yuxuan (2)
Xin, Jie (3)
Yang, Ziyu (3)
Zha, M.-F. (3)
Zhang, Dawei (3)
Zhang, Xing-Tao (1)
Zhou, Ling-song (2)
Zhou, Ling-Song (2)
Zhu, Jinru (2)
吴水才 (4)
曹龙涛 (4)
李红蕊 (3)
梁岩 (3)
樊昌信 (2)
步兵 (4)
王嵩 (2)
罗亚飞 (2)
芮瑞 (3)
贾龙涛 (3)
辛杰 (4)
郭莉莉 (2)
Bai, Haichuan (2)
Bai, Yanning (2)
Chen, Yue (2)
Gao, Shang (1)
Guo, Li-Li (2)
He, Yu-wen (2)
Jia, Long-Tao (2)
Li, Haiting (2)
Li, Hongrui (2)
Li, J (2)
Liu, Haojie (2)
Liu, Jia (2)
Lukasiak, Jason (2)
Luo, Ya-Fei (2)
Pan, Jian-hong (2)
Pan, Jian-Hong (2)
Sha, Yong-tao (2)
Song, Boxuan (2)
Sun, Zheng-yang (2)
Xue, Er-Juan (2)
Zhang, Da-Wei (1)
Zheng, Xiguang (1)
Zhou, Yao (2)
刘靖宇 (2)
卓力 (2)
吴宇轩 (4)
周岭松 (3)
唐繁荣 (4)
崔子豪 (2)
张丽燕 (2)
张兴涛 (2)
张家铭 (4)
朱蓉 (3)
杨毅 (2)
王都生 (1)
胡翔宇 (4)
陈仙红 (2)
陈国顺 (2)
Abhayapala, Thushara D. (1)
An, Shan (1)
Bai, YN (1)
Bao, F. (1)
Chen, G.-S. (1)
Christensen, Mads Graesboll (1)
Christensen, Mads Groesboll (1)
Deng, F. (1)
Deng, Shuhao (1)
Dou, Huijing (1)
Gao, Zhen-zhen (1)
Gao, Zhen-Zhen (1)
Grasboll Christensen, Mads (1)
Hui-Jing, Dou (1)
Kleijn, W.Bastiaan (1)
Liang, Y. (1)
Li, Hai-ting (1)
Li Jing (1)
Li, Lu (1)
Li Ruwei (1)
Li, R.-W. (1)
Liu, Hao-jie (1)
Liu, Hao-Jie (1)
Liu, Jing-Yu (1)
Liu, Mingkuan (1)
Liu, Van (1)
Liu, X. (1)
Liu, Y (1)
Liu, Zexin (1)
Liu, Ze-xin (1)
Liu, Zhangyu (1)
Liu, Zhang-Yu (1)
Li Xiao-ming (1)
Li, X.-M. (1)
Lukasiak, J (1)
Nielsen, Jesper Kjaer (1)
Nielsen, Jesper Kjar (1)
Nielsen, Jesper Kjoer (1)
Niu, J.-H. (1)
Niu, Ji-hua (1)
Niu, Ji-Hua (1)
Qi Fengyan (1)
Qi, Fengyan (1)
Qi, Feng-yan (1)
Qi, FY (1)
Qiu, Jianwei (1)
Qiu, Jian-Wei (1)
Ritz, C (1)
Ru-Wei, Li (1)
Sha, Y.-T. (1)
Sun Jundai (1)
Sun, Zheng-Yang (1)
Sun, Z.-Y. (1)
Tao, Liang (1)
Wang, GP (1)
Wang, Guiping (1)
Wang, Gui-Ping (1)
Wang, Jing (1)
Wang, Q. (1)
Wang, S. (1)
Wang Song (1)
Wang Wenbei (1)
Wang, Wen-Bei (1)
Wang, Xian-yun (1)
Wang, X.-Y. (1)
W.Bastiaan Kleijn (1)
Xia Bingyin (1)
Xia, Biny-Yin (1)
Xia, B.-Y. (1)
Xiong, Wenmeng (1)
Xue, Er-juan (1)
Yang, Y. (1)
Yang, Yi (1)
Yanning, Bai (1)
Zha, Meng-Fang (1)
Zhang, D.-M. (1)
Zhang, Dong-ming (1)
Zhang, Wen (1)
孙俊岱 (2)
孙正阳 (2)
孟宪波 (1)
张大威 (1)
朱娜娜 (1)
李娜 (1)
柳燕 (1)
沙永涛 (1)
牛继华 (1)
王文倍 (2)
王永会 (1)
电子学报 (1)
罗德雨 (1)
薛二娟 (1)
陈楠 (1)
陈浩 (1)
高珍珍 (1)
黄鹤 (1)

Submit Unfold

Language

English (267)
Chinese (201)
Other (1)

Submit

Clean All

Select All Sort by：

Default

Default
Title
Year
WOS Cited Count
Impact factor
Ascending
Descending

< Page ，Total 47 >

Harmonic-Aware Frequency and Time Attention for Automatic Piano Transcription SCIE

期刊论文 | 2024 , 32 , 3492-3506 | IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

Wang, Qi | Liu, Mingkuan | Bao, Changchun | Jia, Maoshen

Abstract&Keyword Cite

Abstract ：

Automatic music transcription (AMT) is to transcribe music audio into note symbol representations. Concurrent notes overlapping in the frequency and time domains still hinder the performance of polyphonic piano transcription in current studies. In this work, we develop an attention-based method for piano transcription, where we propose a harmonic-aware attention to capture the musical frequency structure, and a local time attention to model temporal dependencies. The harmonic-aware frequency attention not only emphasizes the relationship between the obvious harmonics, but also extracts the correlation in the residual non-harmonic component. The time attention mechanism is improved using the learnable attention range masks to model frame-wise short-term dependencies on different subtasks. Experiments on the MAESTRO dataset demonstrate that the proposed system achieves state-of-the-art transcription performance on both frame-wise and note-wise F1 metrics. Considering the influence of the piano pedals' dynamic behavior on note duration, a note duration modification method is also proposed. With a more accurate annotation of the offset on MAESTRO, the transcription performance is further improved.

Keyword ：

harmonic mask harmonic mask Piano transcription Piano transcription time attention time attention piano pedal piano pedal frequency attention frequency attention

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wang, Qi , Liu, Mingkuan , Bao, Changchun et al. Harmonic-Aware Frequency and Time Attention for Automatic Piano Transcription [J]. \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING , 2024 , 32 : 3492-3506 .
MLA	Wang, Qi et al. "Harmonic-Aware Frequency and Time Attention for Automatic Piano Transcription" . \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 32 (2024) : 3492-3506 .
APA	Wang, Qi , Liu, Mingkuan , Bao, Changchun , Jia, Maoshen . Harmonic-Aware Frequency and Time Attention for Automatic Piano Transcription . \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING , 2024 , 32 , 3492-3506 .
Export to	NoteExpress RIS BibTex

Three-Dimensional Room Transfer Function Parameterization Based on Multiple Concentric Planar Circular Arrays SCIE

期刊论文 | 2024 , 32 , 4384-4398 | IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

Li, Lu | Jia, Maoshen | Bao, Changchun

Abstract&Keyword Cite

Abstract ：

This study proposes a three-dimensional room transfer function (RTF) parameterization method based on multiple concentric planar circular arrays, which exhibits robustness to variations in the positions of both the receiver and source. According to the harmonic solution to the wave equation, the RTFs between two spherical regions (sound source and receiver) in a room can be expressed as a weighted sum of spherical harmonics, whose weight coefficients serve as the RTF parameters, which can be estimated by placing multiple concentric planar circular arrays composed of monopole-source pairs (MSPs) and multiple concentric planar circular arrays composed of omnidirectional-microphone pairs (OMPs) in respective source and receiver regions. We use MSP arrays to generate required outgoing soundfields originating from a source region. We derive a method to use OMP arrays to estimate RTF parameters that are concealed within the captured soundfield, which can be employed to reconstruct the RTF from any point in the source region to any point in the receiver region. The accuracy of the RTF parameterization method is validated through simulation testing.

Keyword ：

Position measurement Position measurement parameterization parameterization planar arrays planar arrays Kernel Kernel Harmonic analysis Harmonic analysis Room transfer function Room transfer function Loudspeakers Loudspeakers Receivers Receivers Planar arrays Planar arrays Transfer functions Transfer functions

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Li, Lu , Jia, Maoshen , Bao, Changchun . Three-Dimensional Room Transfer Function Parameterization Based on Multiple Concentric Planar Circular Arrays [J]. \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING , 2024 , 32 : 4384-4398 .
MLA	Li, Lu et al. "Three-Dimensional Room Transfer Function Parameterization Based on Multiple Concentric Planar Circular Arrays" . \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 32 (2024) : 4384-4398 .
APA	Li, Lu , Jia, Maoshen , Bao, Changchun . Three-Dimensional Room Transfer Function Parameterization Based on Multiple Concentric Planar Circular Arrays . \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING , 2024 , 32 , 4384-4398 .
Export to	NoteExpress RIS BibTex

First-Order Relative Harmonic Coefficient-Based Time-Frequency Points Selection for Multi-Source DOA Estimation SCIE

期刊论文 | 2024 , 32 , 3200-3212 | IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

Tao, Liang | Jia, Maoshen | Bao, Changchun | Xiong, Wenmeng

Abstract&Keyword Cite

Abstract ：

As a research focus within the field of array signal processing, multi-source direction-of-arrival (DOA) estimation in enclosed environments has been paid much attention. Contaminated by reverberation, noise, and inter-source interference, DOA estimation become challenging. Hence it is essential to identify time-frequency (TF) points dominated by only one source to alleviate these issues. This paper proposes a TF point selection method for DOA estimation based on the first-order relative harmonic coefficient (RHC). This is first analyzed on the "point" level from two perspective, and we design an adaptive single-source dominant zone (SSDZ) detection method. Subsequently, the relationship between first- and zero-order RHC magnitudes of different types of TF points is explored, and we develop a simple but useful rule to further select TF points in the detected SSDZs. Finally, we adopt two-dimensional (2-D) kernel density estimation (KDE) and peak search to estimate the DOAs of sources after calculating the angles of the detected TF points. The effectiveness and robustness of the proposed method are verified and compared with the reference methods through experiments with both the simulated and real-world recordings.

Keyword ：

DOA estimation DOA estimation multiple sources multiple sources SSDZ SSDZ first-order RHC first-order RHC

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Tao, Liang , Jia, Maoshen , Bao, Changchun et al. First-Order Relative Harmonic Coefficient-Based Time-Frequency Points Selection for Multi-Source DOA Estimation [J]. \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING , 2024 , 32 : 3200-3212 .
MLA	Tao, Liang et al. "First-Order Relative Harmonic Coefficient-Based Time-Frequency Points Selection for Multi-Source DOA Estimation" . \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 32 (2024) : 3200-3212 .
APA	Tao, Liang , Jia, Maoshen , Bao, Changchun , Xiong, Wenmeng . First-Order Relative Harmonic Coefficient-Based Time-Frequency Points Selection for Multi-Source DOA Estimation . \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING , 2024 , 32 , 3200-3212 .
Export to	NoteExpress RIS BibTex

一种基于CTC多层损失的语音识别方法 incoPat

专利 | 2022-06-02 | CN202210619908.5

陈仙红 | 罗德雨 | 鲍长春

Abstract&Keyword Cite

Abstract ：

一种基于CTC多层损失的语音识别方法，属于模式识别、声学领域。该方法对语音识别网络不同层的输出进行规范，使不同层的输出尽量接近所需要的语音识别结果，从而提高语音识别的性能。该方法包括模型训练与模型测试两个阶段：在训练阶段，将预处理后的训练集输入所搭建的多层语音识别网络中，计算不同层的损失和不同层的权重，将不同层损失加权求和得到多层损失，循环计算损失，更新网络参数直至收敛；在测试阶段，将预处理后的测试集输入训练好的多层语音识别网络，输出识别结果。本发明仅仅改变CTC语音识别模型训练阶段的损失函数，并不改变CTC语音识别模型的结构及其语音识别的过程，以低复杂度、低开销的特点提高语音识别的准确率。

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	陈仙红 , 罗德雨 , 鲍长春 . 一种基于CTC多层损失的语音识别方法 : CN202210619908.5[P]. \| 2022-06-02 .
MLA	陈仙红 et al. "一种基于CTC多层损失的语音识别方法" : CN202210619908.5. \| 2022-06-02 .
APA	陈仙红 , 罗德雨 , 鲍长春 . 一种基于CTC多层损失的语音识别方法 : CN202210619908.5. \| 2022-06-02 .
Export to	NoteExpress RIS BibTex

基于广义合成分析和深度神经网络的自回归系数估计方法 CQVIP

期刊论文 | 2021 , 49 (1) , 29-39 | 崔子豪

崔子豪 | 鲍长春 | 电子学报

Abstract&Keyword Cite

Abstract ：

基于广义合成分析和深度神经网络的自回归系数估计方法

Keyword ：

AR系数 AR系数广义合成分析广义合成分析深度神经网络深度神经网络莱文逊-杜宾迭代解莱文逊-杜宾迭代解

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	崔子豪 , 鲍长春 , 电子学报 . 基于广义合成分析和深度神经网络的自回归系数估计方法 [J]. \| 崔子豪 , 2021 , 49 (1) : 29-39 .
MLA	崔子豪 et al. "基于广义合成分析和深度神经网络的自回归系数估计方法" . \| 崔子豪 49 . 1 (2021) : 29-39 .
APA	崔子豪 , 鲍长春 , 电子学报 . 基于广义合成分析和深度神经网络的自回归系数估计方法 . \| 崔子豪 , 2021 , 49 (1) , 29-39 .
Export to	NoteExpress RIS BibTex

Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points SCIE

期刊论文 | 2021 , 29 , 379-392 | IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

Jia, Maoshen | Wu, Yuxuan | Bao, Changchun | Ritz, Christian

WoS CC Cited Count： 17

Abstract&Keyword Cite

Abstract ：

In this article, the direction of arrival (DOA) estimation of multiple speech sources in reverberant environments is investigated based on the recording of a soundfield microphone. First, the recordings are analyzed in the time-frequency (T-F) domain to detect both "points" (single T-F points) and "regions" (multiple, adjacent T-F points) corresponding to a single source with low reverberation (known as low-reverberant-single-source (LRSS) points). Then, a LRSS point detection algorithm is proposed based on a joint dominance measure and instantaneous single-source point (SSP) identification. Following this, initial DOA estimates obtained for the detected LRSS points are analyzed using a Gaussian Mixture Model (GMM) derived by the Expectation-Maximization (EM) algorithm to cluster components into sources or outliers using a rule-based method. Finally, the DOA of each actual source is obtained from the estimated source components. Experiments on both simulated data and data recorded in an actual acoustic chamber demonstrate that the proposed algorithm exhibits improved performance for the DOA estimation in reverberant environments when compared to several existing approaches.

Keyword ：

LRSS point LRSS point Reverberation Reverberation Reflection Reflection reverberant environments reverberant environments Speech processing Speech processing DOA estimation DOA estimation Microphone arrays Microphone arrays Time-frequency analysis Time-frequency analysis Estimation Estimation Direction-of-arrival estimation Direction-of-arrival estimation

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Jia, Maoshen , Wu, Yuxuan , Bao, Changchun et al. Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points [J]. \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING , 2021 , 29 : 379-392 .
MLA	Jia, Maoshen et al. "Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points" . \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 29 (2021) : 379-392 .
APA	Jia, Maoshen , Wu, Yuxuan , Bao, Changchun , Ritz, Christian . Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points . \| IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING , 2021 , 29 , 379-392 .
Export to	NoteExpress RIS BibTex

基于广义合成分析和深度神经网络的自回归系数估计方法 CSCD

期刊论文 | 2021 , 49 (01) , 29-39 | 电子学报

崔子豪 | 鲍长春

CNKI Cited Count： 2

Abstract&Keyword Cite

Abstract ：

自回归(AR)模型是一类描述时序序列相关性的有效方法,经典的AR系数估计方法对残差信号做了简单的假设,在噪声干扰等复杂场景中难以准确估计AR系数,而基于深度神经网络(DNN)的AR(DNN-AR)系数估计方法在训练中容易受到莱文逊-杜宾迭代(LDR)解法的数值稳定性的影响.为改善DNN-AR系数训练的稳定性和整体性能,在保证系统稳定性的前提下,本文利用精度转化提高系统运算速度的思路,提出了基于广义合成分析(GABS)模型的深度网络结构改善方法,提高了AR系数在含噪环境下估计的准确性和网络训练的稳定性.组合DNN的GABS(GABS-DNN)的模型由三个主要部分组成:修正器的谱增强网络、编码器的...

Keyword ：

深度神经网络深度神经网络广义合成分析广义合成分析 AR系数 AR系数莱文逊-杜宾迭代解莱文逊-杜宾迭代解

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	崔子豪 , 鲍长春 . 基于广义合成分析和深度神经网络的自回归系数估计方法 [J]. \| 电子学报 , 2021 , 49 (01) : 29-39 .
MLA	崔子豪 et al. "基于广义合成分析和深度神经网络的自回归系数估计方法" . \| 电子学报 49 . 01 (2021) : 29-39 .
APA	崔子豪 , 鲍长春 . 基于广义合成分析和深度神经网络的自回归系数估计方法 . \| 电子学报 , 2021 , 49 (01) , 29-39 .
Export to	NoteExpress RIS BibTex

Multi-source localization by using offset residual weight SCIE

期刊论文 | 2021 , 2021 (1) | EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING

Jia, Maoshen | Gao, Shang | Bao, Changchun

WoS CC Cited Count： 2

Abstract&Keyword Cite

Abstract ：

Multiple sound source localization is a hot issue of concern in recent years. The Single Source Zone (SSZ) based localization methods achieve good performance due to the detection and utilization of the Time-Frequency (T-F) zone where only one source is dominant. However, some T-F points consisting of components from multiple sources are also included in the detected SSZ sometimes. Once a T-F point in SSZ is contributed by multiple components, this point is defined as an outlier. The existence of outliers within the detected SSZ is usually an unavoidable problem for SSZ-based methods. To solve this problem, a multi-source localization by using offset residual weight is proposed in this paper. In this method, an assumption is developed: the direction estimated by all the T-F points within the detected SSZ has a difference along with the actual direction of sources. But this difference is much smaller than the difference between the directions estimated by the outliers along with the actual source localization. After verifying this assumption experimentally, Point Offset Residual Weight (PORW) and Source Offset Residual Weight (SORW) are proposed to reduce the influence of outliers on the localization results. Then, a composite weight is formed by combining PORW and SORW, which can effectively distinguish the outliers and desired points. After that, the outliers are removed by composite weight. Finally, a statistical histogram of DOA estimation with outliers removed is used for multi-source localization. The objective evaluation of the proposed method is conducted in various simulated environments. The results show that the proposed method achieves a better performance compared with the reference methods in sources localization.

Keyword ：

Multiple sound sources localization Multiple sound sources localization Direction of arrival estimation Direction of arrival estimation Soundfield microphone Soundfield microphone Reverberation Reverberation

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Jia, Maoshen , Gao, Shang , Bao, Changchun . Multi-source localization by using offset residual weight [J]. \| EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING , 2021 , 2021 (1) .
MLA	Jia, Maoshen et al. "Multi-source localization by using offset residual weight" . \| EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING 2021 . 1 (2021) .
APA	Jia, Maoshen , Gao, Shang , Bao, Changchun . Multi-source localization by using offset residual weight . \| EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING , 2021 , 2021 (1) .
Export to	NoteExpress RIS BibTex

GEV Beamforming with BAN Integrating LPS Estimation and Post-filtering CPCI-S

会议论文 | 2020 | 10th IEEE International Conference on Signal Processing, Communications and Computing (IEEE ICSPCC)

Deng, Shuhao | Bao, Changchun | Cheng, Rui

Abstract&Keyword Cite

Abstract ：

Beamforming method can effectively remove background noise, even in the complex environment, so it is widely used in speech enhancement. We propose a novel Generalized Eigenvalue (GEV) beamforming with Blind Analytic Normalization (BAN) method. In this method, the GEV beamformer coefficients are constructed by estimating logarithmic power spectrum (LPS), which are used to filter multichannel speech signals, and post filter technology is used to further remove noise in the beamformed signals. Firstly, in order to estimate the LPS of speech signal in each channel, we use the data-driven method to train the deep neural network (DNN) model. Then, we use the well trained DNN model to estimate LPS, which is used to calculate the power spectral density (PSD) matrix of speech, and further obtain the coefficients of the GEV beamformer. Since the GEV beamformer will cause speech distortion, the BAN is employed to post-process the beamformed signal. Furthermore, single channel speech enhancement is used to reduce residual noise. Our experiment is conducted in 8-channel simulation data set. The experimental results show that, compared with some existing speech enhancement methods, the proposed method can effectively remove background noise and achieve better speech enhancement effect.

Keyword ：

Blind Analytic Normalization Blind Analytic Normalization Post-filtering Post-filtering Generalized Eigenvalue beamforming Generalized Eigenvalue beamforming Deep Neural Network Deep Neural Network

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Deng, Shuhao , Bao, Changchun , Cheng, Rui . GEV Beamforming with BAN Integrating LPS Estimation and Post-filtering [C] . 2020 .
MLA	Deng, Shuhao et al. "GEV Beamforming with BAN Integrating LPS Estimation and Post-filtering" . (2020) .
APA	Deng, Shuhao , Bao, Changchun , Cheng, Rui . GEV Beamforming with BAN Integrating LPS Estimation and Post-filtering . (2020) .
Export to	NoteExpress RIS BibTex

Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter CPCI-S

会议论文 | 2020 | 10th IEEE International Conference on Signal Processing, Communications and Computing (IEEE ICSPCC)

Wang, Dujuan | Bao, Changchun

Abstract&Keyword Cite

Abstract ：

Deep neural network (DNN) based ideal ratio mask (IRM) estimation methods have yielded good performance in monaural speech enhancement. Meanwhile, these methods have also shown considerable potential for beamforming and multichannel speech enhancement. It is crucial for minimum variance distortionless response (MVDR) beamformer to estimate the covariance matrix of the speech and noise accurately. The accurate estimation of time-frequency (T-F) mask has significant impact on the estimation of the covariance matrices. So, in this paper, a complex real and imaginary ratio mask (CRIRM) based MVDR beamformer for speech enhancement using residual network is proposed. First, the real and imaginary masks of speech and noise are estimated by taking advantage of a residual neural network. After that, the estimations of speech and noise are obtained by using the estimated masks. Finally, the covariance matrices of speech and noise are estimated, and applied into the MVDR beamformer. In addition, in order to further reduce residual noise interference, the output of the MVDR beamformer is further processed by an end-to-end monaural speech enhancement module. Experiments show that, the proposed method can better improve the quality and intelligibility of the enhanced speech.

Keyword ：

residual neural network residual neural network postfilter postfilter speech enhancement speech enhancement beamforming beamforming real and imaginary masks real and imaginary masks

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wang, Dujuan , Bao, Changchun . Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter [C] . 2020 .
MLA	Wang, Dujuan et al. "Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter" . (2020) .
APA	Wang, Dujuan , Bao, Changchun . Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter . (2020) .
Export to	NoteExpress RIS BibTex

10| 20| 50 per page

< Page ，Total 47 >

Type
Departments

All Years Choose Year From to