Dr. Ben Medlock

BSc (Dunelm), MPhil (Cantab), PhD (Cantab)

Ben - photo


Research Interests

  • Computer Science
  • Natural Language Processing
  • Applied Machine Learning

  • Projects and Papers

    • Investigating Classification for Natural Language Processing Tasks, VDM 2008  (Amazon)
    • Exploring Hedge Identification in Biomedical Literature, JBMI 2008 (Elsevier)
    • Weakly Supervised Learning for Hedge Classification in Scientific Literature, ACL 2007 (pdf)
    • An Adaptive Language Model Approach to Spam Filtering on a New Corpus (GenSpam), CEAS 2006 (pdf)
    • An Introduction to NLP-based Textual Anonymisation, LREC 2006 (pdf)
    • Paper on LM-based Spam Filtering (LingSpam) - 2003 (pdf)
    • Masters Thesis - A Generative, Adaptive LM Approach to Spam Filtering (pdf)
    • Bachelors Thesis - A Tool for GLR Parsing in Haskell (pdf)

    Personal

    Corpora

    • Download the GenSpam email corpus here (download page).
    • Download the ITAC anonymisation corpus here (download page).
    • Download the Hedge Classification dataset here (download page).