Uniformise common terms. identify the variations used for common terms in the corpus and recode variations to keep only one form. For instance, recode “Second World War”, “2nd World War”, “WW2” into “World_War_2” (note the underline character ‘_’ instead of spaces). Another example: recode “European Union”, “EU”, “EEC”, “Common Market” into “European_Union”.