Unbounded length contexts for PPM JG Cleary, WJ Teahan The Computer Journal 40 (2_and_3), 67-75, 1997 | 518 | 1997 |
A compression-based algorithm for Chinese word segmentation WJ Teahan, Y Wen, R McNab, IH Witten Computational Linguistics 26 (3), 375-393, 2000 | 213 | 2000 |
Using compression-based language models for text categorization WJ Teahan, DJ Harper Language modeling for information retrieval, 141-165, 2003 | 184 | 2003 |
A repetition based measure for verification of text collections and for text categorization DV Khmelev, WJ Teahan Proceedings of the 26th annual international ACM SIGIR conference on …, 2003 | 132 | 2003 |
Text classification and segmentation using minimum cross-entropy WJ Teahan Content-Based Multimedia Information Access-Volume 2, 943-961, 2000 | 128 | 2000 |
Modelling english text WJ Teahan University of Waikato, 1998 | 113 | 1998 |
The entropy of English using PPM-based models WJ Teahan, JG Cleary Proceedings of Data Compression Conference-DCC'96, 53-62, 1996 | 107 | 1996 |
Text mining: A new frontier for lossless compression IH Witten, Z Bray, M Mahoui, B Teahan Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096), 198-207, 1999 | 98 | 1999 |
Universal text preprocessing for data compression J Abel, W Teahan IEEE Transactions on Computers 54 (5), 497-507, 2005 | 73 | 2005 |
Models of English text WJ Teahan, JG Cleary Proceedings DCC'97. Data Compression Conference, 12-21, 1997 | 62 | 1997 |
Probability estimation for PPM WJ Teahan the NZ Comp. Sci. Research Students' Conf., 1995, 1995 | 58 | 1995 |
Enhancing the stability of organic photovoltaics through machine learning TW David, H Anizelli, TJ Jacobsson, C Gray, W Teahan, J Kettle Nano Energy 78, 105342, 2020 | 51 | 2020 |
Artificial Intelligence–Agents and Environments WJ Teahan Bookboon, 2010 | 48 | 2010 |
Storyboarding for visual analytics R Walker, L Ap Cenydd, S Pop, HC Miles, CJ Hughes, WJ Teahan, ... Information Visualization 14 (1), 27-50, 2015 | 42 | 2015 |
Correcting English text using PPM models WJ Teahan, S Inglis, JG Cleary, G Holmes Proceedings DCC'98 Data Compression Conference (Cat. No. 98TB100225), 289-298, 1998 | 42 | 1998 |
Peer-to-Peer Protocols for Resource Discovery in the Grid. NA Al-Dmour, WJ Teahan Parallel and Distributed Computing and Networks, 319-324, 2005 | 39 | 2005 |
Using language models for generic entity extraction IH Witten, Z Bray, M Mahoui, WJ Teahan Proceedings of the ICML Workshop on Text Mining, 14, 1999 | 36 | 1999 |
Experiments on the zero frequency problem JG Cleary, WJ Teahan Proc. Data Compression Conference 480, 1995 | 35 | 1995 |
Artificial Intelligence–Agent Behaviour WJ Teahan bookboon, 2010 | 33 | 2010 |
Parcop: A decentralized peer-to-peer computing system NA Al-Dmour, WJ Teahan Third International Symposium on Parallel and Distributed Computing/Third …, 2004 | 30 | 2004 |