Geeks With Blogs
Chris Breisch   .NET Data Practices
Search this Blog!

Ok, I admit it.  I don’t know when I’d be likely to do a project that needs this, but it is very cool.

OpenNLP is both the name of a group of open source projects related to natural language processing (NLP), and the name of a library of NLP tools written in Java by Jason Baldridge, Tom Morton, and Gann Bierner. My C# port is based upon the latest version (1.2.0) of the Java OpenNLP tools, released in April 2005. Development of the Java library is ongoing, and I hope to update the C# port as new developments occur.

Tools included in the C# port are: a sentence splitter, a tokenizer, a part-of-speech tagger, a chunker (used to "find non-recursive syntactic annotations such as noun phrase chunks"), a parser, and a name finder. The Java library also includes a tool for co-reference resolution, but the code for this feature is in flux and has not yet been ported to C#. All of these tools are driven by maximum entropy models processed by the SharpEntropy library.


Posted on Thursday, May 11, 2006 5:03 AM .NET Development | Back to top

Comments on this post: Statistical parsing of English sentences

# re: Statistical parsing of English sentences
Requesting Gravatar...
I'd really like to see that code! especially intertested in the chunker & name finder.
Left by Stephen on Jul 07, 2006 10:21 AM

# re: Statistical parsing of English sentences
Requesting Gravatar... kinda doin my final year project using this
but i cant get it to run
downloaded all files..
can u plz temme how to go about running the application

Left by anand on Jan 11, 2009 8:14 AM

Your comment:
 (will show your gravatar)

Copyright © Chris J. Breisch | Powered by: