其他摘要 | China has a long and glorious history and of great value to be discovered. History is created by people. The research into human beings are indispensable to the research of history. When we do research on history, we should pay great attention to the psychology, behaviours of individuals and groups and the social and cultural factors behind.In the past, most of researches into history focused on qualitative approach, and seldom focused on quantitative analysis. Besides, doing research on Chinese ancient history requires great understanding of classical Chinese, which is difficult to general public. With the development of computer linguistics, there are many mature methods and tools to analyse text that using computer to deal with modern Chinese. But there is few such tool to use for analysing ancient Chinese meanings. LIWC is a dictionary based on word counting and focused on psychology meanings of text. It is skilled at quantitative analysis in sentences and articles and is widely used in emotional writing and other linguistics related psychology researches. Firstly, we constructed the Ancient Chinese Archive, covering historical books and self-expression articles from the Spring and Autumn period to Qing dynasty and ancient books of 1.7 billion characters, used as database for text analysis. In the pretest study, we applied the word frequency method to analyse classical Chinese texts and figured out that the result is consistent with our knowledge of Chinese ancient history. Based on SC-LIWC, Study 1 constructed CC-LIWC and used it for analysis. The outcoming was verified using hypothesis test method, proved the capability of CC-LIWC for discovering the psychological difference between expressions. In Study 2, we validated CC-LIWC by comparing human scoring and LIWC scoring and tried to optimise the dictionary. Pretest study used word frequency method to analyse classical Chinese texts, and examined the outcoming, providing support to the coming researches. The pretest study chose Xiào (孝, filial piety) and Lǐ (禮/礼, proper rite) as two representative words of Confucianism in Chinese traditional culture. We analysed the change process of the word frequency of these two words in Chinese historical books through dynasties and explained the reason behind the trends. In Confucianism, filial piety is part of proper rite, the correlation coefficient of frequency of Xiào and Lǐ is calculated and there was a moderate correlation between them. Study 1 includes 2 experiments. Experiment 1 obtained all the word items and their corresponding explanations from the online Chinese dictionary and mapping SC-LIWC words with the classical Chinese words, thus we got candidate words for CC-LIWC dictionary; then we check the candidate words manually and remove the words that are not actually mapped; finally, we double check the result and generated CC-LIWC dictionary, included 79 categories and 49136 classical Chinese entries. Experiment 2 used CC-LIWC to analyse the self-expression texts of Confucius and Han Feizi separately and examined the difference between them. The result showed significant difference between Confucius and Han Feizi in emotional process, cognitive process, drives.etc, which led to the discovery of core value differences between Confucianism and Legalism. Study 2 includes 2 experiments. Experiment 3 was aimed to validate CC-LIWC. In the study, we selected 7 psychological dimensions of CC-LIWC and then invited 8 coders to grade 35 pieces of classical Chinese texts according to the 7 dimensions, and final calculated the correlation coefficient between human score and LIWC score. Experiment 4 focused on how to solve the problems that are found and put forward the solution to optimise CC-LIWC dictionary.In order to analyse the psychological meanings of classical Chinese texts, this paper constructed CC-LIWC dictionary and uses it to analyse ancient texts, evaluated the validity of the dictionary and finally optimised it to reach a better validity. The results show that the correlation between CC-LIWC score and human score is within the acceptable range, and some of the dimensions can even reach medium to high level correlation(p<0.01). Finally, we came to the conclusion that CC-LIWC has a considerable validity in analysing psychological meanings of classical Chinese texts. |
修改评论