IBM Watson Developer Certification Study Guide

A one page study guide for the Watson Certification exam: C7020-230 - IBM Watson V3 Application Development

The certification exam covers all of the aspects of building an application that uses Watson services. This includes a basic understanding of cognitive technologies, as well as a practical knowledge of the core APIs.

Get up to speed with these resources:

- Cognitive Computing Primer

- Watson Developer Cloud API Documentation

The aim of this doc is to provide a consolidated view of the required reading and study that is outlined in the IBM Watson Professional Certification Program Study Guide Series.

The Watson services are constantly evolving so always reference back to the Watson Documentation. Please also feel free to contribute or provide feedback if you see anything that is incorrect. Watson is accessed through IBM Bluemix ### Check out and play with Watson services on Bluemix

High-level Exam Objectives

Section 1 - Fundamentals of Cognitive Computing
Section 2 - Use Cases of Cognitive Services
Section 3 – Fundamentals of IBM Watson Developer Cloud

Section 4 - Developing Cognitive applications using Watson Developer Cloud Services

Section 5 - Administration & DevOps for applications using IBM Watson Developer Cloud Services

Section 1 - Fundamentals of Cognitive Computing

1.1. Define the main characteristics of a cognitive system.

Cognitive systems understand, reason and learn
- Must understand structured and unstructured data
- Must reason by prioritizing recommendations and have ability to form hypothesis
- Learn iteratively by repeated training to build better models
Cognitive systems are here to augment human knowledge not replace it
Cognitive systems use natural language processing to interact naturally with humans
Cognitive systems employ machine learning technologies
- Supervised learning versus unsupervised learning
  - Supervised learning is a form of machine learning where humans iteratively train a machine learning model such as a neural network using a data-set with known answers. Each iteration produces more accurate results until confidence is established that new information (of the same type) can be presented and classified with reasonable accuracy. Supervised learning relies on error reduction using gradient minimization mathematics. Supervised learning is appropriate where we have reasonably large data sets with known results and a good idea of which dimensions of the input data drive these results. Neural Networks are examples of supervised machine learning.
    - Classification problem - aim to classify data into discrete classes based on input e.g. given list of symptoms what diseases could the patient have
    - Regression problem - aim to produce an answer on a continuous range based on inputs e.g. predict an exam result given previous results
  - Unsupervised learning is a form of machine learning that uses statistical techniques to cluster input information into different groupings and does not rely on training with known data-sets. Unsupervised learning is searching for relationships and commonalities between inputs. Clustering techniques such as K-Clustering are examples of unsupervised learning. Unsupervised learning can be appropriate where we have little or no data with known results and/or we don’t understand the dimensions of the input data which drive the outcomes.
  - Semi-supervised learning is a hybrid approach using both supervised learning and unsupervised learning techniques and can be useful where we have only a small date-set with known results

1.2 Explain neural nets.

See https://github.com/cazala/synaptic/wiki/Neural-Networks-101 for further details

1.2.1 Neural Nets mimic how neurons in the brain communicate.

Neurons are the basic unit of a neural network. In nature, neurons have a number of dendrites (inputs), a cell nucleus (processor) and an axon (output). When the neuron activates, it accumulates all its incoming inputs, and if it goes over a certain threshold it fires a signal thru the axon.. sort of. The important thing about neurons is that they can learn.

Artificial neurons mimic this model.

Video: https://www.youtube.com/watch?v=bxe2T-V8XRs?v=VID Neural Networks Demystified - Part 1: Data and Architecture

So how does a Neural Network learn? A neural network learns by training. The algorithm used to do this is called back-propagation. After giving the network an input, it will produce an output, the next step is to teach the network what should have been the correct output for that input (the ideal output). Starting from the output layer and working back we adjust the weights across the synapses and neurons to reduce the output error using a mathematical technique called gradient descent. So next time we show that same input to the network, it's going to give an output closer to that ideal one that we trained it to output. This process is repeated for many iterations until we consider the error between the ideal output and the one output by the network to be acceptable.

1.2.1.1. Explain the role of synapse and neuron

A neuron operates by receiving signals from other neurons through connections called synapses. The ML model performs a calculation on each neuron before passing the neuron output to the next layer of neurons via the synapses.

1.2.1.2. Understand weights and bias

For each neuron input there is a weight (the weight that specific connection is given by the ML model).
When a neuron activates if computes its state by adding all the incoming inputs multiplied by it's corresponding connection weight.
In addition neurons always have one extra input, the bias which is always 1 and has it's own connection weight. This makes sure that even when all other inputs are none there's going to be activation in the neuron.
It then applies a function such as the sigmoid function or hyperbolic tangent to generate an output which is passed to the next layer of neurons via the synapses.
Training using back-propogation iteratively adjusts the weights to reduce the output error to acceptable levels

1.2.1.3. List various approaches to neural nets

Link to Wikipedia article => Types of artificial neural networks

There are many types of NN. They can be formed from hardware or software models and can use a variety of topographies and learning algorithms. A basic classification may be

Feed-forward - information moves through the NN from input to output layer with no feedback or looping. I suppose learning may be achieved by changing the design of the network itself such as number of layers, numbers of neurons in each layer, number of connections between neurons.

Radial Basis Function - heavy duty stuff - I don’t really understand

Recurrent Neural Networks - this is the type we are most familiar with. Back-propogation is often used here to reduce error by adjustment of synapse and neuron weightings across the model working from output layer back to the input layer.

Modular Neural Networks - again mimicking biology - we connect the output from one NN to the input of another to form networks of NN’s

Physical Neural Networks - these use physical components with variable properties such as electrical resistance

1.2.1.4. Explain forward and backward propagation

Feed Forward Propagation

A feed-forward neural network is an artificial neural network wherein connections between the units do not form a cycle. As such, it is different from recurrent neural networks. In this network, the information moves in only one direction, forward, from the input nodes, through the hidden nodes (if any) and to the output nodes. There are no cycles or loops in the network.

Video: https://www.youtube.com/watch?v=UJwK6jAStmg?v=VID Neural Networks Demystified Part 2: Forward Propagation

Back Propagation

Backpropagation, an abbreviation for "backward propagation of errors", is a common method of training artificial neural networks used in conjunction with an optimization method such as gradient descent. It calculates the gradient of a cost function with respect to all the weights in the network, so that the gradient is fed to the optimization method which in turn uses it to update the weights, in an attempt to minimize the cost function.

Backpropagation requires a known, desired output for each input value in order to calculate the cost function gradient – it is therefore usually considered to be a supervised learning method; nonetheless, it is also used in some unsupervised networks such as autoencoders. It is a generalization of the delta rule to multi-layered feedforward networks, made possible by using the chain rule (calculus) to iteratively compute gradients for each layer. Backpropagation requires that the activation function used by the artificial neurons (or "nodes") be differentiable e.g. sigmoid or hyperbolic tangent functions.

But how does the backpropagation work? This algorithm adjusts the weights using Gradient Descent calculation.

Video: https://www.youtube.com/watch?v=GlcnxUlrtek?v=VID Neural Networks Demystified - Part 4: Backpropagation

1.2.1.5 Gradient Descent

Video: https://www.youtube.com/watch?v=5u0jaA3qAGk?v=VID Neural Networks Demystified - Part 3: Gradient Descent

Gradient descent is a mathematical technique (based on differential calculus) used to calculate the weightings that result in the minimum error.

Let's say we make a graphic of the relationship between a certain weight and the error in the network's output:

This algorithm calculates the gradient of the line (dE/dW) at a given point. A positive gradient indicates an increasing error. A negative gradient indicates a decreasing error. We can move along the line in whichever direction exhibits a negative gradient until the gradient reaches zero. This is the point with the minimum error. And we can use the weight at this point. This calculation is performed across multiple dimensions to give an array of optimized weights that together produce the smallest error.

The goal of any supervised learning algorithm is to find a function that best maps a set of inputs to its correct output. An example would be a classification task, where the input is an image of an animal, and the correct output would be the name of the animal. The goal and motivation for developing the back-propagation algorithm was to find a way to train a multi-layered neural network such that it can learn the appropriate internal representations to allow it to learn any arbitrary mapping of input to output.

1.3 Explain machine learning technologies

(supervised, unsupervised, reinforcement learning approaches).

1.3.1. Explain the connection between Machine learning and Cognitive systems

Reference: [Computing, cognition and the future of knowing]

http://www.research.ibm.com/software/IBMResearch/multimedia/Computing_Cognition_WhitePaper.pdf

Machine learning is a branch of the larger discipline of Artificial Intelligence, which involves the design and construction of computer applications or systems that are able to learn based on their data inputs and/or outputs. The discipline of machine learning also incorporates other data analysis disciplines, ranging from predictive analytics and data mining to pattern recognition. A variety of specific algorithms are used for this purpose, frequently organized in taxonomies, these algorithms can be used depending on the type of input/ output required.

Many products and services that we use every day from search-engine advertising applications to facial recognition on social media sites to “smart” cars, phones and electric grids are beginning to demonstrate aspects of Artificial Intelligence and Machine Learning. Most consist of purpose-built, narrowly focused applications, specific to a particular service. They use a few of the core capabilities of cognitive computing. Some use text mining. Others use image recognition with machine learning. Most ML applications are limited to the application for which they were conceived and are therefore narrow in their application domains.

Cognitive systems, in contrast, combine five core capabilities that are broadly applicable across domains allowing them to reason with purpose, learn and interact naturally with humans.

1. They create deeper human engagement by finding what really matters to their users

Cognitive systems create more fully human interactions with people —based on the mode, form and quality each person prefers. They take advantage of the data available today to create a fine-grained picture of individuals —such as geolocation data, web interactions, transaction Image 2 history, loyalty program patterns, electronic medical records and data from wearables — and add to that picture details that have been difficult or impossible to detect: tone, sentiment, emotional state, environmental conditions and the strength and nature of a person’s relationships. They reason through the sum total of all this structured and unstructured data to find what really matters in engaging a person. By continuously learning, these engagements deliver greater and greater value, and become more natural, anticipatory and emotionally appropriate.

2. They scale and elevate expertise to make top class expertise available at scale, to the masses

Every industry’s and profession’s knowledge is expanding at a rate faster than any professional can keep up with —journals, new protocols, new legislation, new practices and entirely new fields. A clear example is found in healthcare, where it is estimated that in 1950, it took 50 years to double the world’s medical knowledge; by 1980, seven years; and in 2015, less than three years. Meanwhile, each person will generate one million gigabytes of health-related data in his or her lifetime, the equivalent of about 300 million books. Cognitive systems are designed to help organizations keep pace, serving as a companion for professionals to enhance their performance. Because these systems master the language of professions —the language of medicine, or sales, or cuisine —they can both understand and teach complex expertise. This reduces the time required for a professional to become an expert. In addition, because these systems are taught by leading practitioners —whether in customer service, oncology diagnosis, case law or any other field —they make available to broad populations the know-how of the best experts in the field.

3. They infuse products and services with cognition leading to continuous improvement of services and products

Cognition enables new classes of products and services to sense, reason and learn about their users and the world around them. This allows for continuous improvement and adaptation, and for augmentation of their capabilities to deliver uses not previously imagined. We see this happening already with cars, medical devices, appliances and even toys. The Internet of Things is dramatically expanding the universe of digital products and services —and where code and data go, cognition can now follow.

4. They enable cognitive processes and operations within organisations leading to cost reduction, improved customer experience and profit

Cognition also transforms how a company operates. Business processes infused with cognitive capabilities capitalize on the phenomenon of data, from internal and external sources. This gives them heightened awareness of workflows, context and environment, leading to continuous learning, better forecasting and increased operational effectiveness —along with decision-making at the speed of today’s data. This is good news in a world where, for example, an average billion-dollar company spends almost 1,000 person hours per week managing its suppliers.

5. They enhance exploration and discovery allowing organisations to learn, evolve and achieve competitive advantage

Ultimately, the most powerful tool that cognitive businesses will possess is far better “headlights” into an increasingly volatile and complex future. Such headlights are becoming more important as leaders in all industries are compelled to place big bets —on drug development, on complex financial modeling, on materials science innovation, on launching a startup. By applying cognitive technologies to vast amounts of data, leaders can uncover patterns, opportunities and actionable hypotheses that would be virtually impossible to discover using traditional research or programmable systems alone.

Large-scale machine learning is the process by which cognitive systems improve with training and use.

Cognitive computing is not a single discipline of computer science. It is the combination of multiple academic fields, from hardware architecture to algorithmic strategy to process design to industry expertise.

Many of these capabilities require specialized infrastructure that leverages high-performance computing, specialized hardware architectures and even new computing paradigms. But these technologies must be developed in concert, with hardware, software, cloud platforms and applications that are built expressly to work together in support of cognitive solutions.

1.3.2. Describe some of the main machine learning concepts:

Some useful links

[A Tour of Machine Learning Algorithms ]

http://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/

Loads of ML Algorithms so perhaps useful to categorize by
- Classification by Learning style
  - Supervised Learning
    - The model is fine tuned by a human using a well understood training dataset to minimize the output errors (often referred to as training)
    - Classification and Regression type problems
    - Examples of supervised learning algos are Back Propogation and Logistic Regression
  - Unsupervised
    - We don’t have a well understood data set with known results and we need to find relationships in the data using statistical models
    - Example problems are clustering, dimensionality reduction and association rule learning
    - Example algorithms include: the Apriori algorithm and k-Means.
  - Semi-supervised
    - Small amount of labelled input data
    - Do a bit of supervised learning and augment with unsupervised approaches
    - Example problems are classification and regression.
- Classification by How they work is more complex (see mind map) but two examples are
  - Neural Networks, which as we know, mimic biological systems
  - Regression algorithms model relations between variables to minimize error
  - Instance Based Algorithms compare new data to a well understood data set and make predictions based on similarity
  - and there are loads more...

List of machine learning concepts

https://en.wikipedia.org/wiki/Outline_of_machine_learning

Too much info here to be useful. Interesting to note IBM not listed as providing ML tools or frameworks. Perhaps this is symptomatic of IBM pitching Watson as Cognitive rather than ML?

Supervised learning, unsupervised learning and reinforcement learning: Workflow basics

1.3.2.1. Supervised learning:

We are given a data set and already know what our correct output should look like, having the idea of the relationship between the input and output e.g. study hours drives exam success. We iteratively train the system using a sample of our data till our results are satisfactory then we test using some data we held back. If all is well the system should be able to process new input data that we introduce to it. Typically superviseis used to solve two types of problem

1.3.2.1.1.Classification

In a classification problem we are trying to predict results in a discrete output. In other words we are trying to map input variables into categories. For example we could be predicting grades of exam pass A, B, C etc.
Many of the Watson services provide classification of input data according to pre-trained models
- NLU classifies sentiment expressed for example

1.3.2.1.2.Regression/Prediction

In a regression problem we are trying to predict results with a continuous output meaning that we are trying to map input variables to some continuous function. For example we could be predicting an exam score % based on hours studied.

1.3.2.1.3.Semi-supervised learning

Semi-supervised learning are tasks and techniques that also make use of unlabeled data for training – typically a small amount of data with labelled responses combined with a large amount of unlabeled data.

1.3.2.2. Unsupervised learning:

Unsupervised learning is a type of machine learning algorithm used to draw inferences from datasets consisting of input data without labeled responses. Input data is not labeled and does not have a known result. This is appropriate where we do not understand which dimensions of the input data drive the outputs we are interested in.

1.3.2.2.1.Artificial neural network

An Artificial Neural Network (ANN) is an information processing paradigm that is inspired by the way biological nervous systems, such as the brain, process information.

1.3.2.2.2.Association rule learning

Association rule learning is a rule-based machine learning method for discovering interesting relations between variables in large databases. It is intended to identify strong rules discovered in databases using some measures of interestingness.

1.3.2.2.3.Hierarchical clustering

Hierarchical clustering is a method of cluster analysis which seeks to build a hierarchy of clusters. Strategies for hierarchical clustering generally fall into two types:
- Agglomerative: This is a "bottom up" approach: each observation starts in its own cluster, and pairs of clusters are merged as one moves up the hierarchy.
- Divisive: This is a "top down" approach: all observations start in one cluster, and splits are performed recursively as one moves down the hierarchy.

1.3.2.2.4.Cluster analysis

Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense or another) to each other than to those in other groups (clusters).

1.3.2.2.5.Outlier Detection

The local outlier factor is based on a concept of a local density, where locality is given by {\displaystyle k} k nearest neighbors, whose distance is used to estimate the density. By comparing the local density of an object to the local densities of its neighbors, one can identify regions of similar density, and points that have a substantially lower density than their neighbors. These are considered to be outliers.

1.3.2.3. Reinforcement learning

Inspired by behavioral psychology. These algorithms choose an action and later learn how good the decision was. Over time, the algorithm changes its strategy to learn better and achieve the best reward. Thus, reinforcement learning is particularly well-suited to problems which include a long-term versus short-term reward trade-off. Not same as classic supervised learning as algorithm never told the “correct” result.

https://en.wikipedia.org/wiki/Reinforcement_learning

Google seems to be pursuing this with Deep Mind, a system that is rewarded by another system to create an unsupervised learning AI.

http://www.dailymail.co.uk/sciencetech/article-4420804/Experiment-lead-machine-s-learning-without-humans.html

1.4. Define a common set of use cases for cognitive systems.

- Customer Call Centers

Agent Assist Q&A System

- Problems Solved: Provide a natural language help system so human call agents can rapidly retrieve answers to customer questions

- Services used:

- Conversation

- Natural Language Understanding

- Retrieve & Rank

- Automation of Customer/Technical Support Ticket Routing

- Problems Solved:

Detect the topic of a ticket and route to the appropriate department to handle it
1. Escalate support tickets based on customer sentiment
  1. Route support requests to agents that already solved similar problems by detecting natural language similarities between new customer tickets and resolved ones.
- Services used could include:

- Natural Language Classifier to identify class of problem/ question using bespoke classifier developed in WKS

- Tone analyser to extract sentiment expressed in support ticket

-Natural Language Understanding could pull out entities and keywords and app could use these to lookup agents with recent experience in these areas from database

- Physicians Expert Adviser:

- Example: Watson Discovery Advisor is an example of this

- Problem Solved: Provides relevant medical suggestions and insights in natural language so physicians can more accurately diagnose patients.

- Services used:

-Conversation can be used to provide natural language interface to the Doctor

- Natural Language Understanding can pull out entities, keywords based on domain specific classifier built in WKS

-Retrieve and Rank can search for and present most relevant documents based on output from NLU

Social Media - Data Insights: - Partner: Ground Signal - Problem Solved: Extract useful insights from social media such as Instagram and Twitter by determining the content of photos and topics/sentiment of user posts. - Services used: keyword, entity, and sentiment/tone analysis

1.5. Define Precision, Recall, and Accuracy.

Use the awful cancer example to remember how this works.

Some patients have cancer and are diagnosed with cancer (True Positives)

Some patients have cancer and are not diagnosed with cancer (False Negatives)

Some patients do not have cancer and are diagnosed with cancer (False Positives)

Some patients do not have cancer and are not diagnosed with cancer (True Negatives)

1.5.1. Precision:

Definition: Precision is the percentage of documents labelled as positive that are actually positive.
Formula: True Positives/(True Positives + False Positives)
This is the proportion of patients diagnosed with cancer that really do have cancer. In Physics precision is the closeness of measurements to each other. Precision here indicates the likelihood that a diagnosis of cancer is correct.

1.5.2. Recall:

Recall is the percent of documents labelled as positive were successfully retrieved.
Formula: True Positives/(True Positives + False Negatives)
This is the proportion of patients that really do have cancer which we diagnose. Its a measure of our effectiveness at diagnosing.

1.5.3. Accuracy:

Accuracy is the fraction of documents relevant to a query that were successfully retrieved.
Formula: (True Positives + True Negatives)/Total Document Count
This is the proportion of the total patients we correctly diagnosed. It indicates how close we are to reality. In Physics accuracy is how close we get to the real answer in our experiment.

In basketball an accurate player will manage to place the ball in the hoop. A precise player will always hit the same spot. We want to be both accurate and precise.

1.5.4. Diagrams like this are often useful in capturing the True/False

Positive/Negatives described above: https://www.quora.com/What-is-the-best-way-to-understand-the-terms-precision-and-recall

1.6. Explain the importance of separating training, validation and test data.

Normally to perform supervised learning you need at least two types of data sets:

In one dataset (your "gold standard") you have the input data together with correct/expected output. This dataset is usually prepared either by humans or by collecting some data in semi-automated way (with WKS we can pre-annotate documents using Watson services trained on generic domains to bootstrap our annotation efforts). It is important that you have the expected output for every data row.
The data you are going to apply your model to. In many cases this is the data where you are interested for the output of your model and thus you don't have any "expected" output here yet.

While performing machine learning you do the following:

Training phase: you present your data from your "gold standard" and train your model, by pairing the input with expected output and minimising errors.
Validation/Test phase: in order to estimate how well your model has been trained (that is dependent upon the size of your data, the value you would like to predict, input etc) and to estimate model properties (mean error for numeric predictors, classification errors for classifiers, recall and precision for IR-models etc.)
Application phase: now you apply your freshly-developed model to the real-world data and get the results. Since you normally don't have any reference value in this type of data (otherwise, why would you need your model?), you can only speculate about the quality of your model output using the results of your validation phase.

The validation phase is often split into two parts:

In the first part you just look at your competing models and select the best performing approach using the validation data (=validation)
Then you estimate the accuracy of the selected approach (=test) using a small (25%) subset of the the known dataset. Section 1.7 below.

Hence the separation of data into datasets of 50/25/25 %.

Note - In the case where you don't need to choose an appropriate model from several rivaling approaches i.e. you don’t need to validate you could just re-partition your set that you basically have only a training dataset and a test dataset, without performing the validation of your trained model. I personally partition them 70/30 then.

1.7. Measure accuracy of service.

The goal of the ML model is to learn patterns that generalize well for unseen data instead of just memorizing the data that it was shown during training. Once you have a model, it is important to check if your model is performing well on unseen examples that you have not used for training the model. To do this, you use the model to predict the answer on the evaluation dataset (held out data) and then compare the predicted target to the actual answer (ground truth).

A number of metrics are used in ML to measure the predictive accuracy of a model. The choice of accuracy metric depends on the ML task. It is important to review these metrics to decide if your model is performing well.

1.8. Perform Domain Adaption using Watson Knowledge Studio (WKS).

There is a great YouTube video series for Watson Knowledge Studio here:

Video: https://www.youtube.com/watch?v=XBwpU97D5aE?v=VID Teach Watson with Watson Knowledge Studio

IBM Watson Knowledge Studio is a cloud-based application that enables developers and domain experts to collaborate on the creation of custom annotator components that can be used to identify mentions and relations in unstructured text. Watson Knowledge Studio is: - Intuitive: Use a guided experience to teach Watson nuances of natural language without writing a single line of code - Collaborative: SMEs work together to infuse domain knowledge in cognitive applications. We can use Watson™ Knowledge Studio to create a machine-learning model that understands the linguistic nuances, meaning, and relationships specific to your industry.

To become a subject matter expert in a given industry or domain, Watson must be trained. You can facilitate the task of training Watson with Watson Knowledge Studio. It provides easy-to-use tools for annotating unstructured domain literature, and uses those annotations to create a custom machine-learning model that understands the language of the domain. The accuracy of the model improves through iterative testing, ultimately resulting in an algorithm that can learn from the patterns that it sees and recognize those patterns in large collections of new documents.

The following diagram illustrates how it works. [

]

Based on a set of domain-specific source documents, the team creates a type system that defines **entity types **and **relation types **for the information of interest to the application that will use the model e.g. for car accident reports manufacturers and models may be entities and manufactures/ manufactured by could be a relation.
A group of two or more human annotators annotate a small set of source documents to label words that represent entity types, words that represent relation types between entity mentions, and to identify co-references of entity types. Any inconsistencies in annotation are resolved, and one set of optimally annotated documents is built, which forms the ground truth.
The ground truth (well understood dataset) is used to train a model.
The trained model is used to find entities, relations, and co-references in new, never-seen-before documents.

Integrations of WKS with other Watson Services

We can then deliver meaningful insights to users by deploying the model trained for a particular domain via other Watson cloud-based offerings and cognitive solutions such as Watson NLU or Watson Explorer.

We can also bootstrap annotation by using the AlchemyLanguage (Now NLU) entity extraction service (traine don general purpose new domain) to automatically find and annotate entities in your documents as a quick start to builging a custom classifier. When human annotators begin to annotate the documents, they can see the annotations that were already made by the service and can review and add to them. See Pre-annotating documents with IBM AlchemyLanguage for details.

We could also import industry-specific dictionaries that you downloaded from IBM® Bluemix® Analytics Exchange.

We can import analyzed documents that are in UIMA CAS XMI format. For example, you can import UIMA CAS XMI files that were exported from IBM Watson Explorer content analytics collections or IBM Watson Explorer Content Analytics Studio.

Deploy a trained model to use with the AlchemyLanguage service (now NLU).

Export a trained model to use in IBM Watson Explorer.

1.9. Define Intents and Classes.

The **Natural Language Classifier **service available via WDC, enables clustering or classification based on some measure of inherent similarity or distance given the input data. Such clustering is known as intents or classes.
Where classes may include images, intent is a similar clustering for written utterances in unstructured natural language format.
NLC is an example of unsupervised learning

1.10. Explain difference between ground truth and corpus.

Ground truth is used in both supervised and unsupervised machine learning approaches, yet portray different values and formats.
For example, in a typical supervised learning system, ground truth consisted of inputs (questions) and approved outputs (answers). With the aide of logistical regression and iterative training using the ground truth the system improves in accuracy and is validated and tested (remember 50/25/25)
In unsupervised approach, such as NLC, the ground truth consists of a comma-separated csv or a JSON file that lists hundreds of sample utterances and a dozen or so intents (or classes) classifying those utterances.

1.11. Define the difference between the user question and the user intent.

To answer correctly, we need to understand the intent behind the question, in order to first classify it then take action on it (e.g., with a Dialog API) - The user question is the verbatim question - The user intent maps the user question to a known classification - This is a form of classifying question based on search goals - Intents are the superset of all actions your users may want your cognitive system to undertake. Put another way, questions are a subset of user intents. Questions usually end in "?", but sometimes we need to extract the user intent from the underlying context. - Common examples of user intents:

Automation: “Schedule a meeting with Sue at 5pm next Tuesday.” - Declarative: “I need to change my password.” - Imperative: “Show me the directions to my the nearest gas station.”

Section 2 - Use Cases of Cognitive Services

Watson services are constantly evolving e.g. AlchemyAPI is now defunct with its functionality provided via Watson NLU, NLC etc.

Today (21 April 2014) the following Watson services are available in the Bluemix UK region

Conversation
Discovery
Document Conversion
Language Translator
Natural Language Classifier
Natural Language Understanding
Personality Insights
Retrieve and Rank
Speech to Text
Text to Speech
Tone Analyser
Tradeoff Analytics (deprecated)
Visual Recognition

Note - Alchemy Language is defunct but is still within scope for the exam rather than NLU

Alchemy consisted of:

Alchemy Language

Alchemy Data News

2.1. Select appropriate combination of cognitive technologies based on use-case and data format.

2.1.1. Use case = Agent-assist for email-based customer call center

Data: customer emails
Services: Q&A, Text classification, entity extraction and, keyword extraction
Watson-specific: NLC, R&R, Alchemy Language

2.1.2. Agent-assist for phone-based customer call center

Data: customer voice recordings
Services: Q&A, Speech recognition, text-to-speech, text classification, entity extraction, keyword extraction
Watson-specific: NLC, R&R, Alchemy Language

2.1.3. Expert advisor use case for physicians

Data: natural language intents
Services: Q&A, Text classification, entity extraction and keyword extraction
Watson-specific: NLC, R&R, Alchemy Language

2.1.4. Data insights for Instagram images

Data: images
Services: Image classification and natural OCR
Watson-specific: Visual Insights

2.1.5. Data insights for Twitter

Data: tweets
Services: Text classification, entity extraction, keyword extraction
Watson-specific: NLC and Alchemy Language

2.2. Explain the uses of the Watson services in the Application Starter Kits.

[You can view the list of Watson Starter Kits here] on the Watson Developer Cloud

https://www.ibm.com/watson/developercloud/starter-kits.html

The starter kits provide a functioning example app for specific use cases which can be used as a starting point by developers for their own cognitive apps. Source code is provided on GitHub Watson Developer Cloud repository.

https://github.com/watson-developer-cloud

Answer Retrieval
- Find and surface the most relevant responses to natural language queries from large unstructured data sets. This sample application provides a generic approach to ingest documents & train a ranker using Retrieve & Rank. Additionally the starter kit shows developers how to improve the results of the machine learning algorithm through feature engineering.
- Services used - Retrieve and Rank
News Intelligence
- This sample application demonstrates how to query news content to understand what people are saying or feeling about important topics
- Services used - Discovery
Social Customer Care
- This sample application demonstrates how to gather personality and brand intelligence from tweets and responds with an appropriate next best action.
- Services Used - NLC, Tone Analyser, Personality Insights
Text Message Chatbot
- A "chat" application that forecasts the weather.
- Services used - Conversation, Alchemy Language (deprecated)
Voice of the Customer
- This starter kit provides an approach to implementing an application that uses the Discovery service to gain valuable insights from customer opinions. The source data for the application is Amazon reviews.
- Services used - Discovery
Knowledgebase Search
- This sample application provides a generic approach for how to improve the results of the Discovery Service's search by ingesting and enriching documents with natural language processing and querying with these enrichments.
- Services Used - Discovery

There are also Gallery apps showcasing capability that also have source available on Git Hub.

Sentiment and Emotion
- Uses text analytics to detect sentiment and emotions from people's digital footprints.
- Services Used - Alchemy Language
Election Insights
- Performs natural language processing on 75,000+ news sources as they are published and uses Alchemy's taxonomy breakdown to focus on all things election.
- Services Used - Alchemy Data News
Investment Advisor
- Combines IBM Watson Personality Insights and IBM Watson Tradeoff Analytics services to recommend suitable funds and agents for clients.
- Services Used - Personality Insights, Tradeoff Analytics
Speech to Speech
- Attempting to break language barriers, this app uses three Watson services to translate your speech instantly and speak the translation aloud in a foreign language.
- Services Used - Speech to Text, Language Translator, Text to Speech
News Explorer
- Automatically construct a news information network and present large volumes of news results in an understandable fashion.
- Services Used - Alchemy Data News
Watson Rover
- A Rover that autonomously navigates a maze by interpreting multi-lingual instructions and recognising visual cues
- Services used - Visual Recognition, Speech to Text, Language Translator
Designer Match
- Helps you find accessories by matching designer personalities
- Services Used - Personality Insights
NYC School Finder
- Find the best NYC school for your child.
- Services Used - Personality Insights, Tradeoff Analytics
Nests
- Find the best home to live in the US
- Services Used - Tradeoff Analytics
SF Life
- Find the best area to live in San Francisco based on lifestyle preferences.
- Services Used - Tradeoff Analytics
Your Celebrity Match
- Enter your Twitter handle to see which celebrities' personalities are most similar to yours!
- Services Used - Personality Insights

2.3. Describe Watson Services.

For section 2.2 and 2.3, we deep dive into the Watson services currently available and stated in the study guide. By understanding the services individually, it will help with knowing what services would work for different scenarios.

Natural Language Classifier

https://www.ibm.com/watson/developercloud/doc/natural-language-classifier/index.html

The Natural Language Classifier service applies cognitive computing techniques to return the best matching classes for a sentence or phrase. For example, you submit a question and the service returns keys to the best matching answers or next actions for your application. Your application can then take an action based on this responseand your use case. You create a classifier instance by providing a set of representative strings and a set of one or more correct classes for each training. After training, the new classifier can accept new questions or phrases that it has not previously encountered and return the top matches with a probability value for each match

Intended Use

The Natural Language Classifier is tuned and tailored to short text (1000 characters or less) and can be trained to function in any domain or application. For example;

Tackle common questions from your users that are typically handled by a live agent.
Classify SMS texts as personal, work, or promotional
Classify tweets into a set of classes, such as events, news, or opinions.

Based on the response from the service, an application can control the outcome to the user. For example, you can start another application, respond with an answer, begin a dialog, or any number of other possible outcomes.

Here are some other examples of how you might apply the Natural Language Classifier:

- Twitter, SMS, and other text messages

- Classify tweets into a set of classes, such as events, news, or opinions.

- Analyze text messages into categories, such as Personal, Work, or Promotions.

- Analyze text from social media or other sources and identify whether it relates positively or negatively to an offering or service (use along with Tone Analyser)

You input

Texts to train the model

input via API calls (use POST to classifiers endpoint POST /classifiers/)
uploaded as .csv file to the Beta toolkit
input directly into the Beta toolkit

Service output

Classes ordered by confidence in JSON format for example here we have texts classified by two possible classes - temperature or conditions

{ "classifier_id": "004a12x110-nlc-3450",

"url": "https://gateway.watsonplatform.net/natural-language-classifier/api/v1/classifiers/004a12x110-nlc-3450", "text": "how warm is it",

"top_class": "temperature",

"classes": [

{

"class_name": "temperature",

"confidence": 0.9932755981713393

{ "class_name": "conditions",

"confidence": 0.0067244018286606744

}

]

}

Your app would tend to use the “top_class” returned in the JSON to perform some next action

Note - the service will attempt to classify text it has not previously encountered such as “Should I wear a jumper today” - this text is NOT part of the training data (https://watson-developer-cloud.github.io/doc-tutorial-downloads/natural-language-classifier/weather_data_train.csv) yet it is correctly classified as being temperature related with a confidence level of 79% despite not not mentioning anything obviously related to temperature

**{ "classifier_id": "004a12x110-nlc-3450", "url": "https://gateway.watsonplatform.net/natural-**language-classifier/api/v1/classifiers/004a12x110-nlc-3450", "text": "should I wear a jumper today", "top_class": "temperature", "classes": [ { "class_name": "temperature", "confidence": 0.7855886480455422 }, { "class_name": "conditions", "confidence": 0.21441135195445774 } ] }

How to use the service

The process of creating and using the classifier:

CSV training data file format

Make sure that your CSV training data adheres to the following format requirements:

- The data must be UTF-8 encoded.

- Separate text values and each class value by a comma delimiter. Each record (row) is terminated by an end-of-line character, which is a special character or sequence of characters that indicate the end of a line.

- Each record must have one text value and at least one class value.

- Class values cannot include tabs or end-of-line characters.

- Text values cannot contain tabs or new lines without special handling. To preserve tabs or new lines, escape a tab with \t, and escape new lines with \r, \n or \r\n. For example, Example text\t with a tab is valid, but Example text with a tab is not valid.

- **Always enclose text or class values with double quotation marks in the training data when it includes the following characters: **

Commas ("Example text, with comma"),

Double quotation marks. In addition, quotation marks must be escaped with double quotation marks ("Example text with ""quotation""").

Size limitations

There are size limitations to the training data:

- The training data must have at least **five records **(rows) and no more than 15,000 records.

- The maximum total length of a text value is 1024 characters.

Supported languages

The classifier supports English (en), Arabic (ar), French (fr), German (de), Japanese (ja), Italian (it), Portuguese (pt), and Spanish (es). The language of the training data must match the language of the text that you intend to classify. Specify the language when you create the classifier.

Guidelines for good training

The following guidelines are not enforced by the API. However, the classifier tends to perform better when the training data adheres to them: -

- Limit the length of input text to fewer than 60 words.

- Limit the number of classes to several hundred classes. Support for larger numbers of classes might be included in later versions of the service.

- When each text record has only one class, **make sure that each class is matched with at least 5 - 10 records **to provide enough training on that class.

It can be difficult to decide whether to include multiple classes for a text. Two common reasons drive multiple classes:

- When the text is vague, identifying a single class is not always clear.

- When experts interpret the text in different ways, multiple classes support those interpretations.

However, if many texts in your training data include multiple classes, or if some texts have more than three classes, you might need to refine the classes. For example, review whether the classes are hierarchical. If they are hierarchical, include the leaf node as the class e.g. News/Reviews

Note - as at 21/4/17 the service provides a Beta version of a toolkit online - presumably this offers the ability to load and train the classifier via the web UI rather than via API calls meaning domain experts can train the classifiers themselves - the web UI does not work well to add classes and texts - it works better to load from a .csv file

API Calls

Create the service instance and get your credentials
Load your csv file to the service using username and password obtained at 1

curl -u "0dcfc43f-8ede-490b-b9f8-b8a9313d717d":"YqjQE4WoLosC" -F training_data=@"C:\Users\John\Desktop\IBM Watson Application Developer Certification\weather_data_train.csv" -F training_metadata="{\"language\":\"en\",\"name\":\"My Classifier\"}" "https://gateway.watsonplatform.net/natural-language-classifier/api/v1/classifiers"

Expect a response like indicating the classifier is training

{

"classifier_id" : "90e7acx197-nlc-45886",

"name" : "My Classifier",

"language" : "en",

"created" : "2017-04-21T14:37:50.133Z",

"url" : "https://gateway.watsonplatform.net/natural-language-classifier/api/v1/classifiers/90e7acx197-nlc-45886",

"status" : "Training",

"status_description" : "The classifier instance is in its training phase, not yet ready to accept classify requests"

}

Periodically check status of the new classifier with GET call to classifiers endpoint until it is ready (use the classifier ID returned above not the name)

curl -u "0dcfc43f-8ede-490b-b9f8-b8a9313d717d":"YqjQE4WoLosC" "https://gateway.watsonplatform.net/natural-language-classifier/api/v1/classifiers/90e7acx197-nlc-45886"

{

"classifier_id" : "90e7acx197-nlc-45886",

"name" : "My Classifier",

"language" : "en",

"created" : "2017-04-21T14:37:50.133Z",

"url" : "https://gateway.watsonplatform.net/natural-language-classifier/api/v1/classifiers/90e7acx197-nlc-45886",

"status" : "Available",

"status_description" : "The classifier instance is now available and is ready to take classifier requests."

}

Once the classifier is ready you can send it text for classification using GET or PUT call to the **classify **endpoint (note unless state otherwise GET is assumed)

With GET text is passed in the URL as text parameter

curl -u "0dcfc43f-8ede-490b-b9f8-b8a9313d717d":"YqjQE4WoLosC" "https://gateway.watsonplatform.net/natural-language-classifier/api/v1/classifiers/90e7acx197-nlc-45886/classify?text=How%20hot%20will%20it%20be%20today%3F"

With POST you don’t need to replace blanks with %20 in text and you an present as JSON

curl -X POST -u "0dcfc43f-8ede-490b-b9f8-b8a9313d717d":"YqjQE4WoLosC" -H "Content-Type:application/json" -d "{\"text\":\"How hot will it be today?\"}" "https://gateway.watsonplatform.net/natural-language-classifier/api/v1/classifiers/90e7acx197-nlc-45886/classify"

{

"classifier_id" : "90e7acx197-nlc-45886",

"url" : "https://gateway.watsonplatform.net/natural-language-classifier/api/v1/classifiers/90e7acx197-nlc-45886",

"text" : "How hot will it be today?",

"top_class" : "temperature",

"classes" : [ {

"class_name" : "temperature",

"confidence" : 0.9928373750021761

}, {

"class_name" : "conditions",

"confidence" : 0.007162624997823877

} ]

}

Note - by default IBM collects data from all calls to the service. To opt out specify X-Watson-Learning-Opt-Outwith the value true

More detailed documentation for Natural Language Classifier

AlchemyLanguage

https://www.ibm.com/watson/developercloud/doc/alchemylanguage/

This service is deprecated. It is no longer possible to create a new instance from April 2017. Unfortunately it will still feature heavily in the test.

AlchemyLanguage is a collection of APIs that offer text analysis through natural language processing. The AlchemyLanguage APIs can analyze text and help you to understand its sentiment, keywords, entities, high-level concepts and more.

Intended Use

Use one or all of the natural language processing APIs available through AlchemyLanguage to add high-level semantic information.

Browse the documentation to learn more about each of AlchemyAPI's text analysis service functions:

- Entity Extraction

- Sentiment Analysis

- Emotion Analysis

- Keyword Extraction

- Concept Tagging

- Relation Extraction

- Taxonomy Classification

- Author Extraction

- Language Detection

- Text Extraction

- Microformats Parsing

- Feed Detection

- Linked Data Support

You input

Any publicly-accessible webpage or posted HTML/text document.

Service output

Extracted meta-data including, entities, sentiment, keywords, concepts, relations, authors, and more, returned in XML, JSON, and RDF formats

More detailed documention for AlchemyLanguage

AlchemyData News

AlchemyData provides news and blog content enriched with natural language processing to allow for highly targeted search and trend analysis. Now you can query the world's news sources and blogs like a database.

AlchemyData News indexes 250k to 300k English language news and blog articles every day with historical search available for the past 60 days. You can query the News API directly with no need to acquire, enrich and store the data yourself - enabling you to go beyond simple keyword-based searches.

Intended Use

Highly targeted search, time series and counts for trend analysis and pattern mapping, and historical access to news and blog content.

You input

Build a query with natural language processing to search both the text in indexed content and the concepts that are associated with it.

Service output

News and blog content enriched with our full suite of NLP services. Keywords, Entities, Concepts, Relations, Sentiment, Taxonomy

More detailed documentation for AlchemyData News

Tone Analyzer

The IBM Watson™ Tone Analyzer Service uses linguistic analysis to detect three types of tones from text: emotion, social tendencies, and language style. Emotions identified include things like anger, fear, joy, sadness, and disgust. Identified social tendencies include things from the Big Five personality traits used by some psychologists. These include openness, conscientiousness, extroversion, agreeableness, and emotional range. Identified language styles include confident, analytical, and tentative.

Intended Use

Common uses for the Tone Analyzer service include:

Personal and business communications - Anyone could use the Tone Analyzer service to get feedback about their communications, which could improve the effectiveness of the messages and how they are received.
Message resonance - optimize the tones in your communication to increase the impact on your audience
Digital Virtual Agent for customer care - If a human client is interacting with an automated digital agent, and the client is agitated or angry, it is likely reflected in the choice of words they use to explain their problem. An automated agent could use the Tone Analyzer Service to detect those tones, and be programmed to respond appropriately to them.
Self-branding - Bloggers and journalists could use the Tone Analyzer Service to get feedback on their tone and fine-tune their writing to reflect a specific personality or style.

You input

Any Text

Service output

JSON that provides a hierarchical representation of the analysis of the terms in the input message

Mored detailed documentation for Tone Analyzer

Watson Dialog

Deprecated as of August 15, 2016 still in the test

The IBM Watson Dialog service enables a developer to automate branching conversations between a user and your application. The Dialog service enables your applications to use natural language to automatically respond to user questions, cross-sell and up-sell, walk users through processes or applications, or even hand-hold users through difficult tasks. The Dialog service can track and store user profile information to learn more about end users, guide them through processes based on their unique situation, or pass their information to a back-end system to help them take action and get the help they need.

Intended Use

Walk a user through the steps to reset their password or help them choose a credit card. Developers script conversations as they would happen in the real world, upload them to the Dialog application and allow end users to get the help they need. This service should be used to enable back and forth conversations with a user. For example, clarifying questions, information gathering, walkthroughs and helping a user take an action.

You input

Script conversations based on your expert knowledge of the domain.

Service output

End users can chat with your application using natural language and get the pre-written responses you created.

More detailed documentation for Watson Dialog

Tradeoff Analytics

Tradeoff Analytics is a Watson service that helps people make decisions when balancing multiple objectives. The service uses a mathematical filtering technique called “Pareto Optimization,” that enables users to explore tradeoffs when considering multiple criteria for a single decision. When your company makes decisions, how many factors need to be considered? What’s the process like? How do you know when you’ve found the best option? With Tradeoff Analytics, users can avoid lists of endless options and identify the right option by considering multiple objectives.

Intended Use

Tradeoff Analytics can help bank analysts or wealth managers select the best investment strategy based on performance attributes, risk, and cost. It can help consumers purchase the product that best matches their preferences based on attributes like features, price, or warranties. Additionally, Tradeoff Analytics can help physicians find the most suitable treatment based on multiple criteria such as success rate, effectiveness, or adverse effects.

You input

A decision problem with objectives and options (for example, what is the best car when my goals are type, price, and fuel economy?)

Service output

JSON objects that represent the optimal options and highlight the trade-offs between them. The service recommends using a provided client-side library to consume its JSON output.

For more detailed documentation for Tradeoff Analytics

Watson Conversation

Watson Conversation combines a number of cognitive techniques to help you build and train a bot - defining intents and entities and crafting dialog to simulate conversation. The system can then be further refined with supplementary technologies to make the system more human-like or to give it a higher chance of returning the right answer. Watson Conversation allows you to deploy a range of bots via many channels, from simple, narrowly focused Bots to much more sophisticated, full-blown virtual agents across mobile devices, messaging platforms like Slack, or even through a physical robot.

Suggested uses

Add a chatbot to your website that automatically responds to customers’ most frequently asked questions.
Build Twitter, Slack, Facebook Messenger, and other messaging platform chatbots that interact instantly with channel users.
Allow customers to control your mobile app using natural language virtual agents.

You input

Your domain expertise in the form of intents, entities and crafted conversation

Service output

A trained model that enables natural conversations with end users

With the IBM Watson™ Conversation service, you can create an application that understands natural-language input and uses machine learning to respond to customers in a way that simulates a conversation between humans.

How to use the service

This diagram shows the overall architecture of a complete solution:

Users interact with your application through the user interface that you implement. For example, A simple chat window or a mobile app, or even a robot with a voice interface.
The application sends the user input to the Conversation service.
The application connects to a workspace, which is a container for your dialog flow and training data.
The service interprets the user input, directs the flow of the conversation and gathers information that it needs.
You can connect additional Watson services to analyze user input, such as Tone Analyzer or Speech to Text.
The application can interact with your back-end systems based on the user’s intent and additional information. For example, answer question, open tickets, update account information, or place orders. There is no limit to what you can do.

More detailed documentation for Watson Conversation

Language Translator

The Watson Language Translator service provides domain-specific translation utilizing Statistical Machine Translation techniques that have been perfected in our research labs over the past few decades. The service offers multiple domain-specific translation models, plus three levels of self-service customization for text with very specific language. (Note: The Watson Language Translation service has been rebranded as the Language Translator service. The Language Translator service provides the same capabilities as the Language Translation service, but with simpler pricing.)

Intended use

What can be done with Watson Language Translator? As an example, an English-speaking help desk representative can assist a Spanish-speaking customer through chat (using the conversational translation model). As another example, a West African news website can curate English news from across the globe and present it in French to its constituents (using the news translation model). Similarly, a patent attorney in the US can effectively discover prior art (to invalidate a patent claims litigation from a competitor) based on invention disclosures made in Korean with the Korean Patent Office. Another example would be that a bank can translate all of their product descriptions from English to Arabic using a custom model tailored to that bank's product names and terminology. All of these examples (and more) can benefit from the real-time, domain-specific translation abilities of the Language Translator service.

You input

Plain text in one of the supported input languages and domains.

Service output

Plain text in the target language selected.

More detailed documentation for Language Translator

Personality Insights

Personality Insights extracts and analyzes a spectrum of personality attributes to help discover actionable insights about people and entities, and in turn guides end users to highly personalized interactions. The service outputs personality characteristics that are divided into three dimensions: the Big 5, Values, and Needs. We recommend using Personality Insights with at least 1200 words of input text.

Intended Use

The Personality Insights service lends itself to an almost limitless number of potential applications. Businesses can use the detailed personality portraits of individual customers for finer-grained customer segmentation and better-quality lead generation. This data enables them to design marketing, provide product recommendations, and deliver customer care that is more personal and relevant. Personality Insights can also be used to help recruiters or university admissions match candidates to companies or universities. For more detailed information, see the "Use Cases" section of the Personality Insights documentation.

You input

JSON, or Text or HTML (such as social media, emails, blogs, or other communication) written by one individual

Service output

A tree of cognitive and social characteristics in JSON or CSV format

Personality Insights basics

The Personality Insights service offers a set of core analytics for discovering actionable insights about people and entities. The following sections provide basic information about using the service.

The personality models

The Personality Insights service is based on the psychology of language in combination with data analytics algorithms. The service analyzes the content that you send and returns a personality profile for the author of the input. The service infers personality characteristics based on three models:

Big Five personality characteristics represent the most widely used model for generally describing how a person engages with the world. The model includes five primary dimensions:
Agreeableness is a person's tendency to be compassionate and cooperative toward others.
Conscientiousness is a person's tendency to act in an organized or thoughtful way.
Extraversion is a person's tendency to seek stimulation in the company of others.
Emotional Range, also referred to as Neuroticism or Natural Reactions, is the extent to which a person's emotions are sensitive to the person's environment.
Openness is the extent to which a person is open to experiencing a variety of activities.
Each of these top-level dimensions has six facets that further characterize an individual according to the dimension.
Needs describe which aspects of a product will resonate with a person. The model includes twelve characteristic needs: Excitement, Harmony, Curiosity, Ideal, Closeness, Self-expression, Liberty, Love, Practicality, Stability, Challenge, and Structure.
Values describe motivating factors that influence a person's decision making. The model includes five values:Self-transcendence / Helping others, Conservation / Tradition, Hedonism / Taking pleasure in life, Self-enhancement / Achieving success, and Open to change / Excitement.

More detailed documentation for Personality Insights

Retrieve and Rank

This service helps users find the most relevant information for their query by using a combination of search and machine learning algorithms to detect "signals" in the data. Built on top of Apache Solr, developers load their data into the service, train a machine learning model based on known relevant results, then leverage this model to provide improved results to their end users based on their question or query.

The Retrieve and Rank Service can be applied to a number of information retrieval scenarios. For example, an experienced technician who is going onsite and requires help troubleshooting a problem, or a contact center agent who needs assistance in dealing with an incoming customer issue, or a project manager finding domain experts from a professional services organization to build out a project team.

You input

Your documents Queries (questions) associated with your documents Service Runtime: User questions and queries

Service output

Indexed documents for retrieval Machine learning model (Rank) Service Runtime: A list of relevant documents and metadata

Overview of the Retrieve and Rank service

The IBM Watson™ Retrieve and Rank service combines two information retrieval components in a single service: the power of Apache Solr and a sophisticated machine learning capability. This combination provides users with more relevant results by automatically reranking them by using these machine learning algorithms.

How to use the service

The following image shows the process of creating and using the Retrieve and Rank service:

For a step-by-step overview of using the Retrieve and Rank service, see the Tutorial page.

Technologies

The purpose of the Retrieve and Rank service is to help you find documents that are more relevant than those that you might get with standard information retrieval techniques.

Retrieve: Retrieve is based on Apache Solr. It supports nearly all of the default Solr APIs and improves error handling and resiliency. You can start your solution by first using only the Retrieve features, and then add the ranking component.
Rank: The rank component (ranker) creates a machine-learning model trained on your data. You call the ranker in your runtime queries to use this model to boost the relevancy of your results with queries that the model has not previously seen.

The service combines several proprietary machine learning techniques, which are known as learning-to-rank algorithms. During its training, the ranker chooses the best combination of algorithms from your training data.

Primary uses

The core users of the Retrieve and Rank service are customer-facing professionals, such as support staff, contact center agents, field technicians, and other professionals. These users must find relevant results quickly from large numbers of documents: - Customer support: Find quick answers for customers from your growing set of answer documents - Field technicians: Resolve technical issues onsite - Professional services: Find the right people with the right skills for key engagements

Benefits

The Retrieve and Rank service can improve information retrieval as compared to standard results. - The ranker models take advantage of rich data in your documents to provide more relevant answers to queries. - You benefit from new features developed both by the open source community and from advanced information retrieval techniques that are built by the Watson algorithm teams. - Each Solr cluster and ranker is highly available in the Bluemix environment. The scalable IBM infrastructure removes the need for you to staff your own highly available data center.

About Apache Solr

As previously mentioned, the Retrieve part of the Retrieve and Rank service is based on Apache Solr. When you use Retrieve and Rank, you need to be knowledgeable about Solr as well as about the specifics of the Retrieve and Rank service. For example, when Solr passes an error code to the service, the service passes it to your application without modification so that standard Solr clients can correctly parse and act upon it. You therefore need to know about Solr error codes when writing error-handling routines in your Retrieve and Rank application.

More detailed documentation for Retrieve and Rank

Speech to Text

Watson Speech to Text can be used anywhere there is a need to bridge the gap between the spoken word and its written form. This easy-to-use service uses machine intelligence to combine information about grammar and language structure with knowledge of the composition of an audio signal to generate an accurate transcription. It uses IBM's speech recognition capabilities to convert speech in multiple languages into text. The transcription of incoming audio is continuously sent back to the client with minimal delay, and it is corrected as more speech is heard. Additionally, the service now includes the ability to detect one or more keywords in the audio stream. The service is accessed via a WebSocket connection or REST API.

Intended Use

The Speech to Text service can be used anywhere voice-interactivity is needed. The service is great for mobile experiences, transcribing media files, call center transcriptions, voice control of embedded systems, or converting sound to text to then make data searchable. Supported languages include US English, UK English, Japanese, Spanish, Brazilian Portuguese, Modern Standard Arabic, and Mandarin. The Speech to Text service now provides the ability to detect the presence of specific keywords or key phrases in the input stream.

You input

Streamed audio with Intelligible Speech Recorded audio with Intelligible Speech

Service output

Text transcriptions of the audio with recognized words

Continuous transmission

By default, the service stops transcription at the first end-of-speech (EOS) incident, which is denoted by a half-second of non-speech (typically silence) or when the stream terminates. Set the continuous parameter to true to instruct the service to transcribe the entire audio stream until the stream terminates. In this case, the results can include multiple transcript elements to indicate phrases separated by pauses. You can concatenate the transcript elements to assemble the complete transcription of the audio stream.

More detailed documentation for Speech to Text

2.4. Explain use cases for integrating external systems (such as Twitter, Weather API).

When systems communicate with each other, this is considered Internet of Things

Section 3 – Fundamentals of IBM Watson Developer Cloud

3.1. Distinguish cognitive services on WDC for which training is required or not.

Some IBM Watson services work out-of-the-box as they were pre-trained in a specific domain (domain-adapted). Other Watson services require training. For pre-trained services, it’s critical to know the adapted domains as they indicate the areas in which the service will perform best.

Pre-trained Watson services: - Watson Text-to-Speech - Watson Speech-to-text - Language Translation (conversational, news, and patent domains) - Alchemy Language (open-domain) - Watson Visual Insights - Tone Analyzer - Personality Insights (social media domain)

Services requiring training: - Natural Language Classifier - Rank part of Retrieve and Rank - Visual recognition (custom models)

3.2. Provide examples of text classification using the NLC.

Some examples are: - Sentiment analysis - Spam email detection - Customer message routing - Academic paper classification into technical fields of interest - Forum post classification to determine correct posting category - Patient reports for escalation and routing based on symptoms - News article analysis - Investment opportunity ranking - Web page topic analysis

3.3. Explain the Watson SDKs available as part of the services on Watson Developer Cloud.

Identify the programming languages with SDKs available
Node SDK
Java SDK
iOS SDK
Unity SDK
Python SDK
Android SDK
Describe the advantage and disadvantages of using an SDK
Find the Watson SDKs and other resources on the WDC GitHub
Watson Developer Cloud Github

3.4. Explain the Watson REST APIs available as part of the services on Watson Developer Cloud.

Identify the Language services on WDC
AlchemyLanguage
Conversation
Document Conversion
Language Translator
Natural Language Classifier
Natural Language Understanding
Personality Insights
Retrieve and Rank
Tone Analyzer
Identify the Vision services on WDC
Visual Recognition
Identify the Speech services on WDC
Speech to Text
Text to Speech
Identify the Data Insights services on WDC
AlchemyData News
Discovery
Tradeoff Analytics

3.5. Explain and configure Natural Language Classification.

The service enables developers without a background in machine learning or statistical algorithms to interpret the intent behind text. Configure: - Gather sample text from real end users (fake initially if you have to…but not much) - Determine the users intents that capture the actions/needs expressed in the text

Classify your user text into these user intents - Separate your user text into train/test datasets - Train an NLC classifier on your training dataset - Pass the user input to an NLC classifier - Determine the accuracy, precision, and recall of the NLC classifier using your test dataset - Improve the confidence level iteratively through back propagation or other means.

3.6. Explain and configure Visual recognition.

Describe the process for training a classifier
Explain how to identify images with a specified classifier
Describe the capabilities of Face Detection/Recognition
Describe the capabilities of Natural Scene OCR

3.7. Explain how Personality Insights service works.

3.8. Explain how Tone Analyzer service works.

Describe the common use cases of the Tone Analyzer service
Describe the basic flow of the Tone Analyzer service
Explain the three categories of tone scores and their sub-tones: emotional tone, social tone, and language tone.
Explain how Tone Analyzer service is different from the Alchemy Language - Sentiment Analysis and Emotion Insights service

3.9. Explain and execute Alchemy Language services.

Identify the capabilities of Alchemy Language
Describe the text extraction features of Alchemy Language
Distinguish between keywords, entities, and concepts
Distinguish between document-level and targeted sentiment
Explain the difference between the taxonomy call and the knowledge graph
Explain disambiguation as it relates to entities
Explain how Emotion Analysis service works
What emotions does Emotion Analysis detect?
Describe the main use cases for applying the Emotion Insights service
Describe the main types of positive/negative sentiment extracted from digital text
Describe the API types provided by the Sentiment Analysis service
Describe the differences between sentiment and emotion analyses

3.10. Explain and configure Retrieve and Rank service.

Explain the function of the Retrieve and Rank service
Configure the Retrieve and Rank service
Create Solr cluster
Create and upload Solr configuration
Create Solr collection
Upload and index documents
Create / update ground truth
Create and train Ranker
Evaluate result / update ground truth

Section 4 - Developing Cognitive applications using Watson Developer Cloud Services

4.1. Call a Watson API to analyze content.

Alchemy Language
Create an instance of the Alchemy Language service in Bluemix
Select the correct API to call for text extraction, sentiment analysis, or any of the Alchemy Language services.
Pass your content to your Alchemy services’ endpoint through a RESTful API call
Natural Language Classifier
Gather sample text from real end users (fake initially if you have to…but not much)
Determine the users intents that capture the actions/needs expressed in the text
Classify your user text into these user intents
Separate your user text into train/test datasets
Create an instance of the Natural Language Classifier service in Bluemix
Train an NLC classifier on your training dataset
Pass your content to your NLC services’ endpoint through a RESTful API call
Determine the accuracy, precision, and recall of the NLC classifier using your test dataset
Personality Insights
Create an instance of the Personality Insights service in Bluemix
Gather text from users in their own voice
Ensure you meet the minimum limits for word count (currently 5,000 words) to limit sampling error.
Pass your content to your Personality Insight services’ endpoint through a RESTful API call

4.2. Describe the tasks required to implement the Conversational Agent / Digital Bot.

Document the primary conversation flow for your users.
Determine ways users could diverge from this flow and how to redirect them back.
Determine the primary user intents plus paraphrases at various nodes in your conversation
Define profile variables
Create instances of Dialog and NLC services on Bluemix
Upload these intents plus paraphrases into the NLC Service
Build out conversation flow with Dialog Service
Present your beta conversation agent to end users to capture real end user text
Identify areas where users strayed outside the domain of your conversation agent
Identify areas where your conversation agent misunderstood the user
Update your conversation agent with new intents plus real end user text

4.3. Transform service outputs for consumption by other services.

Natural Language Classifier
Using classifiers from NLC to drive dialog selections in Dialog
Personality Insights
Use the service output from two different textual inputs and compare the personalities based on the results
Speech to text
Use the transcribed output from speed to text as input to language translation
Language translation
Use the translated text from language translation as input to text to speech
AlchemyNews
Use the top article returned by the search from AlchemyNews as input to AlchemyLanuage-Sentiment Analysis and Tone Analyzer
Use the top article returned by the search from AlchemyNews as input to relationship extraction to tell who is trending in the article

4.4. Define common design patterns for composing multiple Watson services together (across APIs).

Cognitive systems tend to gain more value as additional services are composed. With so many services, it’s sometimes hard to tell which services work best together. - Conversation - Goal: Engage user in back-and-forth dialog while detecting and acting on user intent. The specifics of the actions taken are guided by the entities discovered. - Services: Watson Dialog + Natural Language Classifier + entity extraction (Alchemy Language) - Q&A - Goal: Answer a wide range of customer questions while offering precise answers for frequently asked facts and highly relevant passages for less frequent questions that may not have a single best answer - Services: Watson Dialog + Natural Language Classifier + Retrieve and Rank - Agent Assist - Goal: Provide natural language help systems so call agents can rapidly retrieve answers to customer questions - Services: Watson Dialog + Natural Language Classifier + entity extraction (Alchemy Language) - Automated Customer Support Routing - Goal: Detect the topic of a ticket and route to the appropriate department to handle it. E.g. room service, maintenance, housekeeping in the case of hotel guest request routing. - Services: Keyword extraction and sentiment analysis (Alchemy Language) - Data Insights - Goal: Monitor all posts with specific keywords (e.g. for a company’s followers, sponsors, or critiques) to detect what’s being discussed and the sentiment/tone associated to it. - Services used: Keyword extraction, entity extraction, and sentiment/tone analysis (Alchemy Language)

4.5. Design and execute a use case driven service choreography (within an API).

Natural Language Classifier
Create a classifier
Return label information
List classifiers
Dialog
Upload a dialog
Retrieve content
Update content for specific nodes
Start new conversation
Set profile variables
Language Translation
Upload glossary
Return translation status
Translate input
Alchemy Vision
Recognize face
Extract link
Tag image
Detect text

4.6. Deploy a web application to IBM Bluemix.

Configure application’s manifest to request the correct memory and app instance allocations
Configure application with service credentials extracted from VCAP services
Create instances of your required services in IBM Bluemix
Install Cloud Foundry command line tools
Log-in to IBM Bluemix from the command line
Push the application to IBM Bluemix using the Cloud Foundry command line tools

Section 5 - Administration & DevOps for applications using IBM Watson Developer Cloud Services

5.1. Describe the process of obtaining credentials for Watson services.

Use the Bluemix web interface
Get service credentials in Bluemix
Get service credentials programmatically
Manage organizations, spaces, and assigned users in IBM Bluemix
Using tokens with Watson services
Obtain a token
Use a token
Get a token programmatically

5.2. Monitor resource utilization of applications using IBM Watson services.

5.3. Monitoring application performance on IBM Bluemix.

Configure performance monitoring
Monitor performance of applications

5.4. Examine application logs provided on IBM Bluemix.

Log for apps running on Cloud Foundry
View logs from the Bluemix dashboard
View logs from the command line interface
Filter logs
Configure external logs hosts
View logs from external logs hosts

johmca/IBM-Watson-Developer-Certification-Study-Guide

IBM Watson Developer Certification Study Guide

High-level Exam Objectives

Section 1 - Fundamentals of Cognitive Computing

1.1. Define the main characteristics of a cognitive system.

1.2 Explain neural nets.

1.2.1.1. Explain the role of synapse and neuron

1.2.1.2. Understand weights and bias

1.2.1.3. List various approaches to neural nets

1.2.1.4. Explain forward and backward propagation

Feed Forward Propagation

Back Propagation

1.2.1.5 Gradient Descent

1.3 Explain machine learning technologies

1.3.1. Explain the connection between Machine learning and Cognitive systems

1.3.2. Describe some of the main machine learning concepts:

1.3.2.1. Supervised learning:

1.3.2.1.1.Classification

1.3.2.1.2.Regression/Prediction

1.3.2.1.3.Semi-supervised learning

1.3.2.2. Unsupervised learning:

1.3.2.2.1.Artificial neural network

1.3.2.2.2.Association rule learning

1.3.2.2.3.Hierarchical clustering

1.3.2.2.4.Cluster analysis

1.3.2.2.5.Outlier Detection

1.3.2.3. Reinforcement learning

1.4. Define a common set of use cases for cognitive systems.

1.5. Define Precision, Recall, and Accuracy.

1.5.1. Precision:

1.5.2. Recall:

1.5.3. Accuracy:

1.5.4. Diagrams like this are often useful in capturing the True/False

1.6. Explain the importance of separating training, validation and test data.

1.7. Measure accuracy of service.

1.8. Perform Domain Adaption using Watson Knowledge Studio (WKS).

1.9. Define Intents and Classes.

1.10. Explain difference between ground truth and corpus.

1.11. Define the difference between the user question and the user intent.

Section 2 - Use Cases of Cognitive Services

2.1. Select appropriate combination of cognitive technologies based on use-case and data format.

2.1.1. Use case = Agent-assist for email-based customer call center

2.1.2. Agent-assist for phone-based customer call center

2.1.3. Expert advisor use case for physicians

2.1.4. Data insights for Instagram images

2.1.5. Data insights for Twitter

2.2. Explain the uses of the Watson services in the Application Starter Kits.

2.3. Describe Watson Services.

Natural Language Classifier

Intended Use

You input

Service output

How to use the service

CSV training data file format

Size limitations

Supported languages

Guidelines for good training

AlchemyLanguage

Intended Use

You input

Service output

Intended Use

You input

Service output

Intended Use

Common uses for the Tone Analyzer service include:

You input

Service output

Intended Use

You input

Service output

Intended Use

You input

Service output

Suggested uses

You input

Service output

How to use the service

Intended use

You input