Contents // Acknowledgments // vin // 1. Introduction // 1.1 Why Another Introduction to Corpus Linguistics? // 1.2 Outline of the Book // 1.3 Recommendation for Instructors // 2. The Three Central Corpus-linguistic Methods // 2.1 Corpora // 2.1.1 What is a Corpus? // 2.1.2 What Kinds of Corpora are There? // 2.2 Frequency Lists // 2.3 Lexical Co-occurrence: Collocations // 2.4 (Lexico-)Grammatical Co-occurrence: Concordances // 3. An Introduction to R // 3.1 A Few Central Notions: Data Structures, Functions, and Arguments // 3.2 Vectors // 3.2.1 Basics // 3.2.2 Loading Vectors // 3.2.3 Accessing and Processing (Parts of) Vectors // 3.2.4 Saving Vectors // 3.3 Factors // 3.4 Data Frames // 3.4.1 Generating Data Frames // 3.4.2 Loading and Saving Data Frames // 3.4.3 Accessing and Processing (Parts of) Data Frames // 3.5 Lists // 3.6 Elementary Programming Functions // 3.6.1 Conditional Expressions // 3.6.2 Loops // 3.6.3 Rules of Programming // 3.7 Character/String Processing // 3.7.1 Getting Information from and Accessing (Vectors of) Character Strings // 3.7.2 Elementary Ways to Change (Vectors of) Character Strings // 3.7.3 Merging and Splitting (Vectors of) Character Strings without Regular Expressions // 3.7.4 Searching and Replacing without Regular Expressions // 3.7.5 Searching and Replacing with Regular Expressions // 1 // 1 // 4 // 5 // 7 // 12 // 14 // 16 // 19 // 23 // 28 // 28 // 32 // 35 // 42 // 43 // 44 44 46 48 53 59 // 59 // 60 64 68 // 69 // 70 // 70 // 72
79 // v // so VJ // vi • Contents // 4. // 5. // 3.7.6 Merging and Splitting (Vectors of) Character Strings with Regular Expressions // 3.8 File and Directory Operations // Using R in Corpus Linguistics 4.1 Frequency Lists // 4.1.1 // 4.1.2 // 4.1.3 // 4.1.4 // 4.1.5 // 4.1.6 // 4.1.7 // A Frequency List of an Unannotated Corpus A Reverse Frequency List of an Unannotated Corpus A Frequency List of an Annotated Corpus A Frequency List of Tag-word Sequences from an Annotated Corpus // A Frequency List of Word Pairs from an Annotated Corpus A Frequency List of an Annotated Corpus (with One Word Per // A Frequency List of Word Pairs of an Annotated Corpus (with One Word Per Line) // 4.2 // Concordances // 4.2.1 // 4.2.2 // 4.2.3 // 4.2.4 // A Concordance of an Unannotated Text File // A Simple Concordance from Files of a POS-tagged (SGML) Corpus More Complex Concordances from Files of a POS-tagged (SGML) // A Lemma-based Concordance from Files of a POS-tagged and Lemmatized (XML) Corpus // 4.3 Collocations // 4.4 Excursus 1: Processing Multi-tiered Corpora // 4.5 Excursus 2: Unicode // 4.5.1 Frequency Lists // 4.5.2 Concordancing // Some Statistics for Corpus Linguistics // 5.1 Introduction to Statistical Thinking // 5.1.1 Variables and their Roles in an Analysis // 5.1.2 Variables and their Information Value // 5.1.3 Hypotheses: Formulation and Operationalization // 5.1.4 Data Analysis // 5.1.5 Hypothesis (and Significance) Testing // 5.2 Categorical Dependent Variables // 5.2.1 One
Categorical Dependent Variable, No Independent Variable // 5.2.2 One Categorical Dependent Variable, One Categorical Independent // 5.2.3 One Categorical Dependent Variable, 2+ Independent Variables // 5.3 Interval/Ratio-scaled Dependent Variables // 5.3.1 Descriptive Statistics for Interval/Ratio-scaled Dependent // 5.3.2 // Independent Variable // 5.3.3 One Interval/Ratio-scaled Dependent Variable, One Interval/Ratio-scaled Independent Variable // 5.3.4 One Interval/Ratio-scaled Dependent Variable, 2+ Independent Variables // 5.4 Customizing Statistical Plots // 5.5 Reporting Results // Variables // One Interval/Ratio-scaled Dependent Variable, One Categorical // 96 // 99 // 105 // 106 // 106 // no // 112 // 114 // 118 // 124 // 126 // 127 // 127 // 135 // 141 // 146 // 149 // 156 // 166 // 167 // 169 // 173 // 174 // 174 // 174 // 176 // 182 // 183 // 189 // 189 // 192 // 200 // 201 // 201 // 205 // 211 // 214 // 215 215 // Contents • vii // 6. Case Studies and Pointers to Other Applications 219 // 6.1 Introduction to the Case Studies 219 // 6.2 Some Pointers to Further Applications 220 // Appendix 225 // References 229 // Endnotes 237 // Index 243