mAP: reflesh

Assume that we want to estimate an unobserved population parameter

\theta

on the basis of observations

x

. Let

f

be the sampling distribution of

x

, so that

f(x\mid \theta )

is the probability of

x

when the underlying population parameter is

\theta

. Then the function:

\theta \mapsto f(x\mid \theta )\!

is known as the likelihood function and the estimate:

{\hat {\theta }}_{\mathrm {ML} }(x)={\underset {\theta }{\operatorname {arg\,max} }}\ f(x\mid \theta )\!

is the maximum likelihood estimate of

\theta

Now assume that a prior distribution

g

over

\theta

exists. This allows us to treat

\theta

as a random variable as in Bayesian statistics. We can calculate the posterior distributionof

\theta

using Bayes' theorem:

\theta \mapsto f(\theta \mid x)={\frac {f(x\mid \theta )\,g(\theta )}{\displaystyle \int _{\Theta }f(x\mid \vartheta )\,g(\vartheta )\,d\vartheta }}\!

where

g

is density function of

\theta

\Theta

is the domain of

g

The method of maximum a posteriori estimation then estimates

\theta

as the mode of the posterior distribution of this random variable:

{\hat {\theta }}_{\mathrm {MAP} }(x)={\underset {\theta }{\operatorname {arg\,max} }}\ f(\theta \mid x)={\underset {\theta }{\operatorname {arg\,max} }}\ {\frac {f(x\mid \theta )\,g(\theta )}{\displaystyle \int _{\Theta }f(x\mid \vartheta )\,g(\vartheta )\,d\vartheta }}={\underset {\theta }{\operatorname {arg\,max} }}\ f(x\mid \theta )\,g(\theta ).\!

The denominator of the posterior distribution (so-called marginal likelihood) is always positive and does not depend on

\theta

and therefore plays no role in the optimization. Observe that the MAP estimate of

\theta

coincides with the ML estimate when the prior

g

is uniform (that is, a constant function).

When the loss function is of the form

L(\theta ,a)={\begin{cases}0,&{\text{if }}|a-\theta |<c,\\1,&{\text{otherwise}},\\\end{cases}}

Suppose you are answering the questions, if you are right you will get one point, otherwise zero. If our returned result is
1, 0, 0, 1, 1, 1
So your AP calculation looks like this: 1/1, 0, 0, 2/4, 3/5, 4/6

The AP for above example is 0.6917

Average precision[edit]

Precision and recall are single-value metrics based on the whole list of documents returned by the system. For systems that return a ranked sequence of documents, it is desirable to also consider the order in which the returned documents are presented. By computing a precision and recall at every position in the ranked sequence of documents, one can plot a precision-recall curve, plotting precision

p(r)

as a function of recall

r

. Average precision computes the average value of

p(r)

over the interval from

r=0

r=1

:^[9]

\operatorname {AveP} =\int _{0}^{1}p(r)dr

That is the area under the precision-recall curve. This integral is in practice replaced with a finite sum over every position in the ranked sequence of documents:

\operatorname {AveP} =\sum _{k=1}^{n}P(k)\Delta r(k)

where

k

is the rank in the sequence of retrieved documents,

n

is the number of retrieved documents,

P(k)

is the precision at cut-off

k

in the list, and

\Delta r(k)

is the change in recall from items

k-1

k

.^[9]

This finite sum is equivalent to:

\operatorname {AveP} ={\frac {\sum _{k=1}^{n}(P(k)\times \operatorname {rel} (k))}{\mbox{number of relevant documents}}}\!

where

\operatorname {rel} (k)

is an indicator function equaling 1 if the item at rank

k

is a relevant document, zero otherwise.^[10] Note that the average is over all relevant documents and the relevant documents not retrieved get a precision score of zero.

Some authors choose to interpolate the

p(r)

function to reduce the impact of "wiggles" in the curve.^[11]^[12] For example, the PASCAL Visual Object Classes challenge prior to 2010 (a benchmark for computer vision object detection, the evaluation metric changed after 2010 to effectively sample the curve at all unique recall values.) computes average precision by averaging the precision over a set of evenly spaced recall levels {0, 0.1, 0.2, ... 1.0}:^[11]^[12]

\operatorname {AveP} ={\frac {1}{11}}\sum _{r\in \{0,0.1,\ldots ,1.0\}}p_{\operatorname {interp} }(r)

where

p_{\operatorname {interp} }(r)

is an interpolated precision that takes the maximum precision over all recalls greater than

r

p_{\operatorname {interp} }(r)=\operatorname {max} _{{\tilde {r}}:{\tilde {r}}\geq r}p({\tilde {r}})

An alternative is to derive an analytical

p(r)

function by assuming a particular parametric distribution for the underlying decision values. For example, a binormal precision-recall curve can be obtained by assuming decision values in both classes to follow a Gaussian distribution.^[13]

Mean average precision[edit]

Mean average precision for a set of queries is the mean of the average precision scores for each query.

\operatorname {MAP} ={\frac {\sum _{q=1}^{Q}\operatorname {AveP(q)} }{Q}}\!

where Q is the number of queries.

搜尋此網誌

Unfurling Cheetah

mAP: reflesh

Average precision[edit]

Mean average precision[edit]

留言

張貼留言

這個網誌中的熱門文章

AndrewNg's CNN notes(practical issues)

Confidence intervals & Credible intervals