Skip to content

EthanBlackburn/PartsOfSpeech_Tagger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PartsOfSpeech_Tagger

This is a parts of speech tagger written in python. It implements a hidden markov model and the viterbi algorithm. I recommend testing and training on the 'development.txt' and 'training.txt' files. Using any other files will require you to edit the code(only slightly). I was able to achieve ~95% accuracy doing this. The accuracy really depends on how large your corpus is. I wasn't able to get my hands on the Penn treebank corpus, but have read that it is the best for POS tagging.

About

parts of speech tagger using a HMM and the viterbi algorithm

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages