1.Validity
Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons’ responses and performances as scientific inquiry into score meaning. American Psychologist, 50(9), 741–749.
2. Validity in the language classroom
Moss, P. (2003). Reconceptualizing validity for classroom assessment. Educational Measurement: Issues and Practice, 22(4), 13–25.
3. Reliability
Chapelle, C. A. (2013). Reliability in language assessment. In C. A. Chapelle (Ed.), The encyclopaedia of applied linguistics (pp. 1–6). Chichester, UK: Blackwell Publishing.
Alderson, J. C., Clapham, C., & Wall, D. (1995). Language test construction and evaluation. Cambridge, UK: Cambridge University Press. (pp. 87–89)
4. Test specifications/blueprints
Bachman, L. F., & Palmer, A. S. (2010). Language assessment in practice. Oxford, UK: Oxford University Press. (pp. 368–382)
5. Language use and language ability
Bachman, L. F., & Palmer, A. S. (2010). Language assessment in practice. Oxford, UK: Oxford University Press. (pp. 33–58)
6. Kinds of tests and testing
Hughes, A. (2003). Testing for language teachers (2nd ed.). Cambridge, UK: Cambridge University Press. (pp. 11–25)
7. Describing characteristics of language use and language assessment tasks
Bachman, L. F., & Palmer, A. S. (2010). Language assessment in practice. Oxford, UK: Oxford University Press. (pp. 59–84)
8. Developing assessment tasks
Bachman, L. F., & Palmer, A. S. (2010). Language assessment in practice. Oxford, UK: Oxford University Press. (pp. 306–320)
9. Summative and formative assessment
Weir, C. J. (2001). The formative and summative uses of language test data: Present concerns and future directions. In C. Elder & A. Davies (Eds.), Experimenting with uncertainty: Essays in honour of Alan Davies (pp. 117–125). Cambridge, UK: Cambridge University Press.
10. Washback
Alderson, J. C., & Wall, D. (1993).Does washback exist? AppliedLinguistics, 14, 115–129.
11. Prototypes, prototyping and field tests
Fulcher, G., & Davidson, F. (2007). Language testing and assessment: An advanced resource book. London, UK: Routledge. (pp. 76–89)
12. Classical Item Analysis
Alderson, J. C., Clapham, C., & Wall, D. (1995). Language test construction and evaluation. Cambridge, UK: Cambridge University Press. (pp. 80–86).
Fulcher, G., & Davidson, F. (2007). Language testing and assessment: An advanced resource book. London, UK: Routledge. (pp. 101–104)