Institutional Repository, Institute of Psychology, Chinese Academy of Sciences
Research on Recognition Algorithm of Main Speakers in Meeting Based on Speed Difference | |
Guo, Honglian1,2; Zhao, Ke1,2 | |
2023 | |
通讯作者邮箱 | [email protected] (k. zhao) |
会议名称 | Proceedings of the 3rd IEEE International Conference on Social Sciences and Intelligence Management, SSIM 2023 |
会议录名称 | Proceedings of the 3rd IEEE International Conference on Social Sciences and Intelligence Management |
页码 | 271-274 |
会议日期 | 2023 |
会议地点 | 不详 |
摘要 | Speech recognition based on speed difference and psychological effect on the speakers' performance is a complex problem involving speech processing and psychological analysis. In this study, we aimed to recognize the main speaker in a meeting by analyzing the speaker's speech characteristics, speed difference, and sense of agency (SoA). SoA refers to an individual's perception of control or agency over their actions and outcomes. To achieve this, we proposed a technology that detects the main speaker by analyzing the speech rate consistency among speakers. Typically, the main speaker in a meeting maintains a consistent speech rate, while other speakers may vary their speech rate. We employed a sliding window approach to segment the continuous speech stream and estimated the speed of each segment to generate a speed curve. By identifying the local minimum on this curve, we determined the changepoints used by different speakers in turn. Next, we considered the segments between two adjacent changepoints whose speed was lower than a certain threshold. These segments were accurately identified and classified as belonging to the main speaker. Experimental results demonstrated that our method improved the recognition of speakers by analyzing speech characteristics, speed difference, and SoA, as well as assessing the perception of control or agency over speaking actions and outcomes. The performance of our method was improved by approximately 30% compared to traditional methods. In conclusion, our research contributes to the field of speech recognition by incorporating psychological factors such as SoA and speed difference in the identification of main speakers in meetings. This approach enhances the accuracy and efficiency of speaker recognition systems, leading to improved performance in various applications. |
DOI | 10.1109/SSIM59263.2023.10469636 |
收录类别 | EI |
语种 | 英语 |
引用统计 | |
文献类型 | 会议论文 |
条目标识符 | http://ir.psych.ac.cn/handle/311026/47465 |
专题 | 脑与认知科学国家重点实验室 |
作者单位 | 1.State Key Laboratory of Brain and Cognitive Science, Institute of Psychology, Chinese Academy of Sciences, Beijing, China 2.University of Chinese Academy of Sciences, Department of Psychology, Beijing, China |
推荐引用方式 GB/T 7714 | Guo, Honglian,Zhao, Ke. Research on Recognition Algorithm of Main Speakers in Meeting Based on Speed Difference[C],2023:271-274. |
条目包含的文件 | 条目无相关文件。 |
个性服务 |
推荐该条目 |
保存到收藏夹 |
查看访问统计 |
导出为Endnote文件 |
谷歌学术 |
谷歌学术中相似的文章 |
[Guo, Honglian]的文章 |
[Zhao, Ke]的文章 |
百度学术 |
百度学术中相似的文章 |
[Guo, Honglian]的文章 |
[Zhao, Ke]的文章 |
必应学术 |
必应学术中相似的文章 |
[Guo, Honglian]的文章 |
[Zhao, Ke]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论