PSYCH OpenIR  > 脑与认知科学国家重点实验室
Research on Recognition Algorithm of Main Speakers in Meeting Based on Speed Difference
Guo, Honglian1,2; Zhao, Ke1,2
2023
通讯作者邮箱[email protected] (k. zhao)
会议名称Proceedings of the 3rd IEEE International Conference on Social Sciences and Intelligence Management, SSIM 2023
会议录名称Proceedings of the 3rd IEEE International Conference on Social Sciences and Intelligence Management
页码271-274
会议日期2023
会议地点不详
摘要

Speech recognition based on speed difference and psychological effect on the speakers' performance is a complex problem involving speech processing and psychological analysis. In this study, we aimed to recognize the main speaker in a meeting by analyzing the speaker's speech characteristics, speed difference, and sense of agency (SoA). SoA refers to an individual's perception of control or agency over their actions and outcomes. To achieve this, we proposed a technology that detects the main speaker by analyzing the speech rate consistency among speakers. Typically, the main speaker in a meeting maintains a consistent speech rate, while other speakers may vary their speech rate. We employed a sliding window approach to segment the continuous speech stream and estimated the speed of each segment to generate a speed curve. By identifying the local minimum on this curve, we determined the changepoints used by different speakers in turn. Next, we considered the segments between two adjacent changepoints whose speed was lower than a certain threshold. These segments were accurately identified and classified as belonging to the main speaker. Experimental results demonstrated that our method improved the recognition of speakers by analyzing speech characteristics, speed difference, and SoA, as well as assessing the perception of control or agency over speaking actions and outcomes. The performance of our method was improved by approximately 30% compared to traditional methods. In conclusion, our research contributes to the field of speech recognition by incorporating psychological factors such as SoA and speed difference in the identification of main speakers in meetings. This approach enhances the accuracy and efficiency of speaker recognition systems, leading to improved performance in various applications.

DOI10.1109/SSIM59263.2023.10469636
收录类别EI
语种英语
引用统计
文献类型会议论文
条目标识符http://ir.psych.ac.cn/handle/311026/47465
专题脑与认知科学国家重点实验室
作者单位1.State Key Laboratory of Brain and Cognitive Science, Institute of Psychology, Chinese Academy of Sciences, Beijing, China
2.University of Chinese Academy of Sciences, Department of Psychology, Beijing, China
推荐引用方式
GB/T 7714
Guo, Honglian,Zhao, Ke. Research on Recognition Algorithm of Main Speakers in Meeting Based on Speed Difference[C],2023:271-274.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Guo, Honglian]的文章
[Zhao, Ke]的文章
百度学术
百度学术中相似的文章
[Guo, Honglian]的文章
[Zhao, Ke]的文章
必应学术
必应学术中相似的文章
[Guo, Honglian]的文章
[Zhao, Ke]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。