How is DCG calculated?

Table of Contents

How is DCG calculated?

Using a graded relevance scale of documents in a search-engine result set, DCG measures the usefulness, or gain, of a document based on its position in the result list. The gain is accumulated from the top of the result list to the bottom, with the gain of each result discounted at lower ranks.

What is DCG metric?

Discounted Cumulative Gain (DCG) is the metric of measuring ranking quality. It is mostly used in information retrieval problems such as measuring the effectiveness of the search engine algorithm by ranking the articles it displays according to their relevance in terms of the search keyword.

What is ideal DCG?

Normalized DCG This is done by sorting all relevant documents in the corpus by their relative relevance, producing the maximum possible DCG through position p , also called Ideal DCG (IDCG) through that position. Normalized DCG (nDCG) is given by nDCGp=DCGpIDCGp.

What is DCG information retrieval?

NDCG is a measure of ranking quality. In Information Retrieval, such measures assess the document retrieval algorithms. In this article, we will cover the following: Justification for using a measure for ranking quality to evaluate a recommendation engine.

Why do we use NDCG?

Measuring Search Relevance using NDCG Normalized Discounted Cumulative Gain (NDCG) is popular method for measuring the quality of a set of search results. It asserts the following: Very relevant results are more useful than somewhat relevant results which are more useful than irrelevant results (cumulative gain)

How do you calculate NDCG?

Compute Normalized Discounted Cumulative Gain. Sum the true scores ranked in the order induced by the predicted scores, after applying a logarithmic discount. Then divide by the best possible score (Ideal DCG, obtained for a perfect ranking) to obtain a score between 0 and 1.

How NDCG is calculated?

nDCG calculates the Cumulative Gain of a set of results by summing up the total relevance of each item in the result set. Then, the position of each item is discounted for, meaning the lower a relevant item is in the list, the higher the penalty or the discount that the item contributes to the total score.

How is NDCG calculated?

NDCG Calculation In words, we first order the list of candidate answers in descending order based on their relevance score. Then we compute another score for each of this word by taking their respective relevance score and dividing it by the log (base 2) of their rank plus 1 (to avoid division by 0).

What is a good NDCG?

8 NDCG is 80% of the best ranking. This is an intuitive explanation the real math includes some logarithms, but it is not so far from this.

What is a good NDCG value?

This best ranking would have a DCG score of 3 / 1 + 2 / 2 + 2 / 3 + 0 / 4 = 4.67. This is known as the “ideal DCG,” or iDCG. Our NDCG is the score we got (3.5) divided by the ideal DCG (4.67), or 3.5 / 4.67 = 0.75.

What does NDCG of 1 mean?

en.wikipedia.org/wiki/Discounted_cumulative_gain nDCG is there so that the values fall between 0 and 1 and has “natural” interpretation. If so, the score of 1 means that the order of hits in a search is perfectly ordered by relevance while 0 is the opposite. 0.5 means half the hits are ordered ok.

What does NDCG 1 mean?

How do you normalize DCG values?

To normalize DCG values, an ideal ordering for the given query is needed. For this example, that ordering would be the monotonically decreasing sort of all known relevance judgments. In addition to the six from this experiment, suppose we also know there is a document

How is the gain accumulated in DCG?

The gain is accumulated from the top of the result list to the bottom, with the gain of each result discounted at lower ranks. Two assumptions are made in using DCG and its related measures.

How do you compare two DCG values?

In order to compare, the DCG values must be normalized. To normalize DCG values, an ideal ordering for the given query is needed. For this example, that ordering would be the monotonically decreasing sort of all known relevance judgments. In addition to the six from this experiment, suppose we also know there is a document

How do you find the maximum DCG of a corpus?

This is done by sorting all relevant documents in the corpus by their relative relevance, producing the maximum possible DCG through position p p, also called Ideal DCG (IDCG) through that position. Where |REL| | R E L | is the list of documents ordered by relevance in the corpus up to position p.