Towards a Corpus of Historical Australian English ego-documents
A brief survey for linguists

Australian English should be seen as a major player among Late Modern Englishes, but there is a shortage of linguistic data for studying its evolution in the 19th and early 20th centuries. This survey invites advice from linguists about the most useful parameters for developing a representative corpus of historical Australian English based on ego-documents (letters, diaries, memoirs) written by people who were born in Australia or had arrived there in early childhood (<10 years of age).

This survey is based on findings of a pilot study of letters written to Government Botanist Ferdinand Mueller at the Royal Botanic Gardens in Melbourne between 1860 and 1898. That study identified 132 letters written by 87 Australian-born correspondents, with an average word count of 180. Other collections have been identified in Western Australia and Queensland which may yield 5,000 or more letters, and public and private collections in Tasmania and South Australia and New South Wales are also being investigated.

Given this range of materials to be drawn upon, this survey asks 8 questions about the scope of the corpus: what period of time it should encompass (based on writer/speaker year of birth), how many writer/speakers it should seek to include, and what would be an acceptable minimum word count. It should take a maximum of 10 minutes to complete.

Further comments or questions are welcome to madeleine.clews@uwa.edu.au
1.The Late Modern Era (LME) in World Englishes has a range of suggested starting points from 1700 (Beal) to 1800 (Algeo & Pyles).

The British colonisation of Australia has a clear starting point of the arrival of the First Fleet in 1788 and the creation of New South Wales. The final two colonies of Western Australia and South Australia were established more than two generations later, in 1829 and 1836 respectively.

When should the timespan for a corpus of historical AusE as a LME (based on identified speaker birth-year) begin?
2.If you selected 'Other' as your answer to the question above, what do you think the corpus starting point for speaker birth year should be, and why?
3.Suggested endpoints for the LME era range from 1945 (Beal) to 2000 (Algeo & Pyles) to still in progress (Culpeper). When should the timespan for a historical corpus of AusE as a LME (based on identified speaker birth-year) end?
4.If you selected 'Other' as your answer to the question above, what do you think the the corpus end point for speaker birth year should be, and why?
5.Assuming a reasonable geographic coverage of specific birthplaces around the Australian states is achieved, what do you consider to be an acceptable minimum number of individual writer/speakers (born in the colonies) in the corpus?
6.The average length of a letter in the pilot survey was found to be 180 words. It is likely that multiple letters from an individual speaker may be available. Do you consider that there should be a maximum number of words per individual letter-writer included in the corpus?
7.If you selected 'other' as your answer for the question above, please elaborate here.
8.What do you consider to be an acceptable minimum wordcount for the proposed corpus?
Thank you for completing the survey. Questions and comments are welcome to madeleine.clews@uwa.edu.au