版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進行舉報或認(rèn)領(lǐng)
文檔簡介
1、語料庫研究基本方法,中國外語教育研究中心 梁茂成,主要內(nèi)容,語料庫語言學(xué)的性質(zhì) 幾個常用術(shù)語 語料庫研究的基本方法,語料庫語言學(xué)的性質(zhì),理性主義與經(jīng)驗主義 Rationalism: I think therefore I am. Empiricism: My mind is a blank slate. Seeing is believing.,語料庫語言學(xué)的性質(zhì),the Wax Argument: He considers a piece of wax; his senses inform him that it has certain characteristics, such as sh
2、ape, texture, size, color, smell, and so forth. When he brings the wax towards a flame, these characteristics change completely. However, it seems that it is still the same thing: it is still a piece of wax, even though the data of the senses inform him that all of its characteristics are different.
3、,語料庫語言學(xué)的性質(zhì),the Wax Argument: Therefore, in order to properly grasp the nature of the wax, he cannot use the senses. He must use his mind. Descartes concludes: “ And so something which I thought I was seeing with my eyes is in fact grasped solely by the faculty of judgment which is in my mind.,語料庫語言學(xué)
4、的性質(zhì),Empiricism: Empiricism emphasizes those aspects of scientific knowledge that are closely related to evidence, especially as discovered in experiments. It is a fundamental part of the scientific method that all hypotheses and theories must be tested against observations of the natural world, rath
5、er than resting solely on reasoning and intuition.,語料庫語言學(xué)的性質(zhì),Science is considered to be methodologically empirical in nature. Corpus linguistics is empirical in nature.,語料庫語言學(xué)的性質(zhì),語言研究中的數(shù)據(jù)類型 內(nèi)省數(shù)據(jù)(introspective data): rationalism 實驗數(shù)據(jù)(experimental data): empiricism 真實數(shù)據(jù)(anthentic data): empricism,語料庫
6、語言學(xué)的性質(zhì),語料庫語言學(xué)提倡真實數(shù)據(jù) 我們不排斥其他數(shù)據(jù)類型,語料庫語言學(xué)的性質(zhì),即便在語料庫語言學(xué)陣營之中 Corpus-driven: minimum theory-reliance. Exclusive reliance on corpus data for all theories Corpus-based: Reliance on corpus data for hypothesis-testing Corpus-referenced/informed: Occasionally resorting to corpus data for illustrations,語料庫語言學(xué)的性
7、質(zhì),我們堅決反對不顧語言事實的任何論斷 No introspection can claim credence without verification through real language data (Teubert 2005).,幾個常用術(shù)語,Corpus Corpus linguistics,幾個常用術(shù)語,Token, type, lemma The little boy looked at the other boys.,幾個常用術(shù)語,Collocation is defined as a sequence of words which co-occur more often t
8、han would be expected by chance. a big smoker a strong smoker a hard smoker a heavy smoker a furious smoker,幾個常用術(shù)語,It is quite possible, in fact, to describe a woman as handsome. However, this implies that she is not beautiful at all in the traditional sense of female beauty, but rather that she is
9、mature in age, has large features and a certain strength of character. Similarly, a man could be described as beautiful, but this would usually imply that he had feminine features.,幾個常用術(shù)語,Colligation is defined as a sequence of grammatical categories which co-occur more often than would be expected
10、by chance.,幾個常用術(shù)語,Semantic prosody is instantiated when a word such as CAUSE co-occurs regularly with words that share a given meaning or meanings, and then acquires some of the meaning(s) of those words as a result. This acquired meaning is known as semantic prosody. (Stewart 2010),語料庫研究的基本方法,Corpu
11、s-based approach: a hypothesis-testing approach Corpus-driven approach: with as “few preconceived ideas” as possible, “keeping the amount of theory-reliance to a minimum in order not to hinder the process of discovering new phenomena” (Rmer 2005),語料庫研究的基本方法,Both approaches almost always involve a co
12、mparion of some kind.,語料庫研究的基本方法,Sizes of corpora in comparison (Rayson 2003) Small big Equal sizes,語料庫研究的基本方法,Types of comparison Across genres Across users Across different times Across (varieties of) language(s),語料庫研究的基本方法,Corpus comparability,語料庫研究的基本方法,Linguistic features in corpus comparison Lexical Lexico-grammatical Syntactic Discoursal,語料庫研究的基本方法,Statistic tests in corpus comparison Simple: Relationship (correlation, etc) Diff
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 《生物安全管理要求》課件
- 《生物質(zhì)碳化技術(shù)》課件
- 2025年宇宙生命之謎
- 2024-2025學(xué)年浙江省麗水市“五校高中發(fā)展共同體”高一上學(xué)期10月聯(lián)考?xì)v史試題(解析版)
- 單位管理制度集粹匯編【員工管理篇】
- 2025年高考數(shù)學(xué)一輪復(fù)習(xí)之常用邏輯用語
- 單位管理制度匯編大合集【員工管理】十篇
- 單位管理制度合并匯編職工管理十篇
- 2024春節(jié)放假安全風(fēng)險應(yīng)急預(yù)案范文(32篇)
- 《穴盤育苗技術(shù)》課件
- 針灸推拿學(xué)100512練習(xí)題庫與參考答案
- 常用截面慣性矩與截面系數(shù)的計算
- 行車工考試試題
- 小兒頭皮靜脈輸液課件
- 電力電纜高頻局放試驗報告
- 肺病科主任年度述職匯報
- 2023年福建省晉江市數(shù)學(xué)七年級第一學(xué)期期末學(xué)業(yè)質(zhì)量監(jiān)測試題含解析
- 水利水電工程基礎(chǔ)坑隱蔽工程驗收證書
- 余熱發(fā)電工程總施工組織設(shè)計方案
- 建設(shè)工程監(jiān)理費計算器(免費)
- 希望點-列舉法
評論
0/150
提交評論