Conferences in Research and Practice in Information Technology

Online Version - Last Updated - 20 Jan 2012



Procedures and Resources for Authors

Information and Resources for Volume Editors

Orders and Subscriptions

Published Articles

Upcoming Volumes

Contact Us

Useful External Links

CRPIT Site Search

Scamseek - A Language Technology Project Fulfilling Research Objectives with Industrial Obligations

Patrick, J.

    The Scamseek project, as commissioned by the Australian Securities & Investment Commission (ASIC), had the principal objective of building an industrially viable system that retrieves scam candidate texts from the Internet and classifies them as to their potential risk of containing an illegal investment proposal or advice. The value of the system is the gain of significant time and efficiency savings for the human analyst. On the other hand the classificatory precision of discovering classes consisting of less than 1% of the corpus was considered unachievable by conventional word based text classification methods, hence a hitherto unexplored semantic model of language was adopted for expressing the feature space of the documents. The project thus was defined in terms of research objectives, particularly accurate detection of minute classes and representation of text classification as modelled through a strong linguistic theory. At the same time the project was obliged to produce an industrial quality system with adherence to concomitant performance criteria.
Cite as: Patrick, J. (2005). Scamseek - A Language Technology Project Fulfilling Research Objectives with Industrial Obligations. In Proc. South East Asia Regional Computer Confederation (SEARCC) Conference 2005 : ICT Building Bridges, Sydney, Australia. CRPIT, 46. Low, G., Ed. ACS. 3-10.
pdf (from pdf (local if available) BibTeX EndNote GS


ACS Logo© Copyright Australian Computer Society Inc. 2001-2014.
Comments should be sent to the webmaster at
This page last updated 16 Nov 2007