Student research opportunities

Map of Measures to Improve Reliability in Technology Evaluation

Project Code: CECS_1133

This project is available at the following levels:
Honours, Masters, PhD

Keywords:

Evaluation Measures; Performance Evaluation; Statistical Machine Learning

Supervisor:

Assoc Professor Hanna Suominen

Outline:

The project studies measures that are used in statistical machine learning, artificial intelligence, and document analysis to evaluate how well methods perform in terms of processing correctness. Regardless of the cruciality of the right measure to reliable results, connections between the gamut of available measures are unclear, and confusingly, the same or almost the same measure has many names. Cataloguing, relating, and discussing the pros and cons of the measures not only promotes the use of the right measures but also improves understanding of technological similarities and differences.

For example, in binary classification for HIV coding, the most common measure – accuracy (proportion of true to all codes) – gives misleading results. Because 99.989 per cent of Australians are HIV-negative, a coder that always assigns negative is 99.989 per cent accurate but far from perfect. Even if refining evaluation to true-positives and true-negatives, the probability of a patient having HIV given a positive code is less than 50 per cent if the coder assigns 99.9 per cent of true-positives and 99.99 per cent of true-negatives correctly. Achieving superior sensitivity (proportion of true positives to all codes that should have been positive, a.k.a. recall, true positive rate, hit rate, power, and 1 – false negative rate) is trivial by coding everything as positives at the expense of deteriorating precision (proportion of true-positives to all positives by the automated coder).

Goals of this project

Catalog of measures with their mathematical relations specified
Discussion of the pros and cons of the measures with empirical results to illustrate theoretical conclusions

Requirements/Prerequisites

Solid programming skills, preferably using Matlab, Java, or Python

Success in the ANU course(s) of Artificial Intelligence and/or Document Analysis and/or Introduction to Statistical Machine Learning

Links

Artificial Intelligence
Document Analysis
Introduction to Statistical Machine Learning

Contact:



Updated:  14 May 2015 / Responsible Officer:  JavaScript must be enabled to display this email address. / Page Contact:  JavaScript must be enabled to display this email address. / Powered by: Snorkel 1.4