中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
读者阅读不同结构语篇的认知模式与计算分析

文献类型:学位论文

作者李琳
答辩日期2022-12
文献子类博士
授予单位中国科学院大学
授予地点中国科学院心理研究所
其他责任者杨玉芳
关键词主题结构 反驳结构 语篇理解 语篇理解 大数据
学位名称理学博士
学位专业认知神经科学
其他题名Cognitive mechanism during different structural texts and computational analysis
中文摘要Text extraction in the field of computational linguistics often encounters a variety of problems. Real readers provide the basis for natural language processing. Reader's cognitive processing is influenced by text structures. The current paper controls the text structure and combines the method of eye tracking and machine learning to investigate readers’ cognitive mechanism of text comprehension with two studies(four experiments). The first study examined how text structures influenced text comprehension processing. The second study examined the relationship between text structures and eye movement measures to further explore the text reading cognitive processes and their computational modeling. The first study contains two experiments for revealing the structure effect during text reading comprehension and its cognitive processes. Experiment 1 controlled the thematic representative structure of texts and investigated the interactive effect between text structure and repeat task. It is found that the position of topic sentence influenced readers’ eye movement behavior, moderated by repeat task. Readers processed texts strategically during repeat period. The result supported situation model theory, a high hierarchical situation model constructed and facilitated subsequent processing during text comprehension. The repeat effect also supported context-dependent representation model. Experiment 2 manipulated the thematic and refutation structures of the texts, recording the eye-movement trajectories of 68 readers on reading the 32 structure-manipulated texts. The results showed that the refutation structure improved reading efficiency by reducing the total reading time and looking back at areas of misconception and increasing the rereading time of scientific concepts, implying that conceptual change altered the readers' online processing strategies. The effect of the refutation structure was also moderated by the position of the topic sentence, with the effect significant only in the condition where the topic sentence was in its initial position and not in the condition where the topic sentence was in its final position, suggesting that topic sentences placed in the initial position helped readers to construct a model of the textual context. Readers conceptual change with a complete situation model in mind reduced their cognitive load and reading time. Experiment 2 manipulated the thematic and refutation structures of the texts, recording the eye-movement trajectories of 68 readers on reading the 32 structure-manipulated texts. The results showed that the refutation structure improved reading efficiency by reducing the total reading time and looking back at areas of misconception and increasing the rereading time of scientific concepts, implying that conceptual change altered the readers' online processing strategies. The effect of the refutation structure was also moderated by the position of the topic sentence, with the effect significant only in the condition where the topic sentence was in its initial position and not in the condition where the topic sentence was in its final position, suggesting that topic sentences placed in the initial position helped readers to construct a model of the textual context. Readers conceptual change with a complete situation model in mind reduced their cognitive load and reading time. Experiment 3 used the eye-movement data of readers reading four- structure discourses in Experiment 2, compressed the eye-movement features with the Lasso algorithm and used support vector machine to test the prediction effect, showing that the eye-movement data can predict the text structure, supporting the findings of Experiment 2 in reverse. It shows that reading processing is moderated by discourse structure, and the eye-movement measures that are most sensitive to distinguishing discourse structures were successfully screened. Experiment 4 collected eye-movement data from 1200 discourses in the weibo corpus and used methods of cognitive modelling to construct the cognitive process of readers when reading texts. High similarity score refers to high eye movement trajectory similarity. Then we used Hidden Markov chain to model eye movement trajectory within claused and then calculate the transfer matrix. The results show that both the structure and position of the clauses and sentences in a natural discourse can influence the reader's processing. From the studies, we examined that the process of reading a text is moderated by text structures, which can enable readers to read the text more strategically and improve teading efficiency. This study uses machine learning methods to improve the reliability and validity of this study, and also provides new inspiration for future psycholinguistic analysis methods.
英文摘要计算领域的文本摘要往往遇到各种各样的问题,真实读者的阅读模式可以为自然语言加工提供依据。读者的认知加工进程受到语篇结构的影响。本研究着手于语篇结构,结合眼动和机器学习等方法,通过两个研究四个实验来探讨读者理解语篇的认知机制。研究一考察语篇结构对语篇理解的认知过程的影响,研究二用机器学习和数据建模的方式考察读者阅读文本时的眼动过程和语篇结构的关系,进一步探索读者阅读语篇的认知进程及其计算建模。 研究一用两个眼动实验揭示语篇阅读理解中的结构效应及其认知进程。 实验 1 操控了文章主题表征的构成形式,并考察语篇结构与重读任务的交互作用及其认知过程。研究发现主题句的位置能够影响读者的眼动轨迹,并且受到重读任务的调节,读者在重读时会有策略地利用语篇结构。实验结果支持情景模型理论,说明读者在理解语篇时存在一个上层的情景模型,并且在线加工时会加以利用。重读效应的发现也支持了语境依赖表征模型。 实验 2操控了文本的主题结构和反驳结构,记录 68名读者阅读 32篇操控语篇结构的文本的眼动轨迹。结果表明反驳结构提高了阅读效率,它减少了读者对误解区域的总阅读时间和重读时间,增加了对科学概念的重读时间,这意味着概念转换改变了读者的在线处理策略。反驳结构的作用也受到主题句位置的调节,只有在主题句初始位置的条件下效果显著,而在主题句在最终位置的条件下效果不显著,说明置于初始位置的主题句有助于读者构建文本情境模型,在构建情景模型完成后再进行概念转化,减少了读者的认知负荷,同时减少了读者的阅读时间。 研究二用机器学习的算法构建了语篇分类模型,并且在自然语篇中验证了语篇结构的效应。 实验 3 以实验 2 中的读者阅读四种结构语篇的眼动数据为基础,用 Lasso 算法压缩眼动特征,使用支持向量机检验预测效果,说明眼动数据可以预测文本结构,逆向支持了实验 2 的结论。说明读者的阅读加工进程受到语篇结构的调控,并成功自动筛选出了对区分语篇结构有效的眼动指标。 实验 4 采集了读者阅读微博语料库中 1200 个语篇的眼动数据,采用认知建模的方式分析读者阅读语篇时的认知过程,首先使用 ScanMatch 计算不同结构和位置的小句眼动轨迹的相似性,相似性得分高的眼动轨迹相似性更高,再用隐马尔可夫对小句内部的眼动轨迹进行建模,计算出眼动轨迹的的动态转移矩阵,结果发现自然语篇中小句的结构和位置都能够影响读者的加工进程。 通过上述实验研究,我们了解了读者阅读语篇的过程受到语篇结构的调节,语篇结构能使读者更加有策略的阅读文本,提高阅读效率。本研究使用了机器学习等方法,提高了本研究的信度和效度,也为今后心理语言学的分析方法提供了新的灵感。
语种中文
源URL[http://ir.psych.ac.cn/handle/311026/44373]  
专题心理研究所_认知与发展心理学研究室
推荐引用方式
GB/T 7714
李琳. 读者阅读不同结构语篇的认知模式与计算分析[D]. 中国科学院心理研究所. 中国科学院大学. 2022.

入库方式: OAI收割

来源:心理研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。