This is the code repository for Natural Language Processing with Java Second Edition, published by Packt.
Techniques for building machine learning and neural network models for NLP
Natural Language Processing (NLP) allows you to take any sentence and identify patterns, special names, company names, and more. The second edition of Natural Language Processing with Java teaches you how to perform language analysis with the help of Java libraries, while constantly gaining insights from the outcomes.
This book covers the following exciting features:
- Understand basic NLP tasks and how they relate to one another
- Discover and use the available tokenization engines
- Apply search techniques to find people, as well as things, within a document
- Construct solutions to identify parts of speech within sentences
- Use parsers to extract relationships between elements of a document
If you feel this book is for you, get your copy today!
All of the code is organized into folders. For example, Chapter02.
The code will look like the following:
System.out.println(tagger.tagString("AFAIK she H8 cth!"));
System.out.println(tagger.tagString(
"BTW had a GR8 tym at the party BBIAM."));
Following is what you need for this book: Natural Language Processing with Java is for you if you are a data analyst, data scientist, or machine learning engineer who wants to extract information from a language using Java. Knowledge of Java programming is needed, while a basic understanding of statistics will be useful but not mandatory.
With the following software and hardware list you can run all code files present in the book (Chapter 1-12).
Chapter | Software required | OS required |
---|---|---|
1 | OpenNLP | Windows, Mac OS X, and Linux (Any) |
Stanford CoreNLP | ||
LingPipe | ||
Standford Tagger | ||
2 | OpenNLP Models | Windows, Mac OS X, and Linux (Any) |
3 | LingPipe Models | Windows, Mac OS X, and Linux (Any) |
4 | OpenNLPModels | Windows, Mac OS X, and Linux (Any) |
5 | Gate Twitter Model | |
LingPipe POS Models | Windows, Mac OS X, and Linux (Any) | |
6 | Stanford Classifier | Windows, Mac OS X, and Linux (Any) |
8-12 | Boilerpipe | |
POI | ||
PDFBox | Windows, Mac OS X, and Linux (Any) |
We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.
Richard M. Reese has worked in both industry and academia. For 17 years, he worked in the telephone and aerospace industries, serving in several capacities, including research and development, software development, supervision, and training. He currently teaches at Tarleton State University. Richard has written several Java books and a C Pointer book. He uses a concise and easy-to-follow approach to teaching about topics. His Java books have addressed EJB 3.1, updates to Java 7 and 8, certification, functional programming, jMonkeyEngine, and natural language processing.
AshishSingh Bhatia is a learner, reader, seeker, and developer at core. He has over 10 years of IT experience in different domains, including banking, ERP, and education. He is persistently passionate about Python, Java, R, and web and mobile development. He is always ready to explore new technologies.
Click here if you have any feedback or suggestions.