This survey is part of the preparation for the 9th Web as Corpus Workshop (WAC-9), co-located with EACL 2014 at Gothenburg and endorsed by the Special Interest Group of the ACL on Web as Corpus (SIGWAC). WAC-9 is organized by Felix Bildhauer and Roland Schäfer of Freie Universität, Berlin.

http://www.sigwac.org.uk/wiki/WAC9

We would like to ask you to answer a few questions related to a planned panel discussion about a potential shared task on the quality of NLP tools (in the broadest sense) applied to web data. The shared task is intended to follow the earlier CLEANEVAL shared task on web document processing, and it would be part of one of the future WAC workshops. Since we can only have a panel discussion and a shared task if there is significant interest, we would be grateful if you could fill out this form and let us know what you think. It will only take a minute or two. The information you provide will be used only for the purpose of organizing the planned panel discussion of a potential shared task at WAC-9. It will not be shared with any third party.

* 1. Please specify your name or at least a pseudonym.

* 2. Please specify your email address. This is required only if you volunteer as a panelist for the panel discussion at WAC-9 in Question 5.

* 3. Are you going to attend WAC-9 at EACL 2014 in Gotheburg? (Note: This is NOT a registration, and you are not committing to anything at this point.)

* 4. Do you think that WAC-10 and/or WAC-11 should include a shared task about the quality of web document post-processing (such as boilerplate removal, lemmatization, POS tagging, etc.)?

——— Note on Questions 5–8: By registering your intention to participate in the discussion and/or by suggesting tracks and languages for the shared task, you do NOT volunteer to do any actual work related to the shared task, yet. We are simply trying to assess the overall interest in a shared task on web document processing. ———

* 5. Would you participate in a panel discussion about such a shared task during WAC-9 at Gothenburg on April 26, 2014?

* 6. What are the tracks of a potential shared task that you would like to see?

* 7. Other suggested tracks for a potential shared task?

* 8. Which languages should be dealt with in a potential shared task? (Please specify only languages on which you have done NLP/corpus construction work yourself.)

T