Conferences in Research and Practice in Information Technology
  

Online Version - Last Updated - 20 Jan 2012

 

 
Home
 

 
Procedures and Resources for Authors

 
Information and Resources for Volume Editors
 

 
Orders and Subscriptions
 

 
Published Articles

 
Upcoming Volumes
 

 
Contact Us
 

 
Useful External Links
 

 
CRPIT Site Search
 
    

Scamseek - A Language Technology Project Fulfilling Research Objectives with Industrial Obligations

Patrick, J.

    The Scamseek project, as commissioned by the Australian Securities & Investment Commission (ASIC), had the principal objective of building an industrially viable system that retrieves scam candidate texts from the Internet and classifies them as to their potential risk of containing an illegal investment proposal or advice. The value of the system is the gain of significant time and efficiency savings for the human analyst. On the other hand the classificatory precision of discovering classes consisting of less than 1% of the corpus was considered unachievable by conventional word based text classification methods, hence a hitherto unexplored semantic model of language was adopted for expressing the feature space of the documents. The project thus was defined in terms of research objectives, particularly accurate detection of minute classes and representation of text classification as modelled through a strong linguistic theory. At the same time the project was obliged to produce an industrial quality system with adherence to concomitant performance criteria.
Cite as: Patrick, J. (2005). Scamseek - A Language Technology Project Fulfilling Research Objectives with Industrial Obligations. In Proc. South East Asia Regional Computer Confederation (SEARCC) Conference 2005 : ICT Building Bridges, Sydney, Australia. CRPIT, 46. Low, G., Ed. ACS. 3-10.
pdf (from crpit.com) pdf (local if available) BibTeX EndNote GS
 

 

ACS Logo© Copyright Australian Computer Society Inc. 2001-2014.
Comments should be sent to the webmaster at crpit@scem.uws.edu.au.
This page last updated 16 Nov 2007