Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Any good materials to read? I work in a text-analytics startup, so I do have a basic understanding about NLP and related fields.


Good materials to read? I work in a text-analytics startup, so I do have a basic understanding about NLP and related fields.

I think have a solid ML understanding can serve you well if you are doing NLP.

* Good Freely Available Textbooks on Machine Learning? http://metaoptimize.com/qa/questions/186/

Besides that, I guess it depends specifically on what NLP tasks you are looking at. Can you tell me more?


Thanks! We primarily work on sentiment analysis, anaphora resolution, topic identification. Also work on topic classification and clustering.


If you do anaphora resolution you should maybe look at this recent paper: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.154...

it's simple enough to get running, fast, and outperforms the state of the art of a couple of years ago.


Also, any good materials for those of us who have no background in NLP besides basic computer science theory?


Good materials for those of us who have no background in NLP besides basic computer science theory?

My main proposal is to post a "How do I build X" question on MetaOptimize. (I don't know any place on the web where experienced NLP people chat, besides MetaOptimize, but I would be interested to hear other options.) NLP isn't really very hard once you know what you're doing, it's just that as an inexperienced person you will spend a lot of time going down dead ends and blind alleys, and there are a lot of pitfalls to avoid. So getting the advice of experienced people, right from the beginning, is the best way to do this.

See also:

* What are the best resources to use when starting machine learning for an experienced programmer? http://metaoptimize.com/qa/questions/334/

* How do I get started understanding statistics & data analysis? http://metaoptimize.com/qa/questions/154/

* New to data mining - where to start? http://metaoptimize.com/qa/questions/362/


Read the NLTK book available online for free here: http://nltk.org

The nltk tools them-selves might not always be state of the art (e.g. compare with the java libs from the NLP dept in Stanford) but at least you will get to know all the major concepts.

Other interesting book, although not completely finished yet: http://www.manning.com/ingersoll/ Taming Text by Grant Ingersol, Thomas Morton and Drew Farris involved in the Apache Lucene / Mahout and OpenNLP communities.


I am just a beginner into this field, but I will really like to venture out into NLP and ML, what are the best resources that I should start with?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: