Conversational search analytics

Plus Enterprise IBM Cloud Pak for Data IBM Software Hub

Overview

You can analyze the performance of your conversational search by using a holistic routing graph of your assistant. In the watsonx Assistant home page, go to Analyze > Conversational search to open the conversational search statistics as a preview.

Analyzing data and Conversational search scores

You can see the average scores for citations per response, answer length, answer confidence, and extractiveness in the draft configuration. You can filter conversational search responses that use successful conversational search responses or “I don’t know”. Click any Customer input to view the inline citations for that conversational search.

Hover on the information icon () next to the customer input to see the query text inferred from the context.

For a single utterance-response pair, you can view the following metrics:

Response confidence score

The response confidence score is the estimated probability that the assistant’s response is correct, relevant, and useful in addressing the user's query or request for the available content.

Retrieval confidence score

The retrieval confidence score measures how certain the system is that it retrieved the most relevant information from its database to answer a user's query. It is the estimated probability that the retrieved data contains the necessary details to respond accurately to the user's request.

Extractiveness

Extractiveness is the extent to which the response is directly derived from the input. It is the fraction of the response that consists of sequences of words that are in the search results. A high score indicates that much of the response is directly quoted from the sources. A low score indicates that the response is abstracted or paraphrased from the sources. However, it can also mean that the response is not supported by the sources.

Citations

Citations refer to the acknowledgment of the sources of data, models, or algorithms that the system uses to generate its outputs or make its predictions. On the analytics page, you can see the number of citations that are associated with the response.

Response length

The number of characters in the response.

Average citations per response

The average number of citations that are received by each response that is provided by the assistant.

Average response length

The average length of characters required to provide a helpful response.

For all the metrics, the average is the average among all questions for which we generated a response, regardless of whether we provided it or not.