Corpus Applications of Regular Expressions
正則表達式的語料庫應用
2025/08/25 | 19:00-20:00
中國人民大學 盧達威副教授
Prof. LU Dawei, Associate Professor, School of Liberal Arts, Renmin University of China
Abstract:
Corpora are an important tool for linguistic research. Regular expressions, as a key text-matching tool, can provide precise corpus search functions. With the help of text editors that support regular expressions, they can also assist in organizing corpora, helping to build text corpora. At the same time, regular expressions can support corpus annotation. This lecture first introduces the basic concepts, principles, and matching rules of regular expressions, and then, using EmEditor as an example, demonstrates their applications in corpus searching, construction, and annotation, providing useful corpus tools for beginners.
語料庫是語言學研究的重要手段。正則表達式作為一種重要的文本匹配工具,能夠提供精准的語料庫檢索功能。借助文字編輯器的正則表達式功能,還可以協助完成語料整理工作,有助於構建文本語料庫。同時,利用正則表達式,還能為語料標注提供幫助。講座首先介紹正則表達式的基本的概念、原理和匹配規則,進而以Emeditor為例,介紹正則表達式在語料庫檢索、構建、標注方面的應用,為初學者提供有用的語料庫工具。
7:00 pm - 8:00 pm
7:00 pm - 8:00 pm