Skip to main content

Practical crowdsourcing (w/ stopping criterion)

Practical crowdsourcing should monitor the confidence of the label and set the stopping criterion.
Created on July 22|Last edited on July 27

Desiderata: Higher accuracy & higher convergence rate




  • kcenter doesn't quite help still
  • k-NN performance drops a lot when using dimension reduced features

  • k-NN has a long tail -> not calibrated?

p(yi∣Z,xi)∝p(yi∣xi,Z/i)p(yi∣zi)p(y_i|Z, x_i) \propto p(y_i|x_i, Z_{/i}) p(y_i|z_i)

  • p(yi∣xi,Z/i)=∏j∼Nip(yj∣zj)s(i,j)p(y_i|x_i, Z_{/i}) = \prod_{\mathcal{j \sim N_i}} p(y_j|z_j)^{s(i, j)}
  • p(yi∣xi,Z/i)=∏j∼Nip(yj∣zj)p(eij∣xi,xj)=∏j∼Nip(yj∣zj)s(i,j)p(y_i|x_i, Z_{/i}) = \prod_{\mathcal{j \sim N_i}} p(y_j|z_j) p(e_{ij}| x_i, x_j) = \prod_{\mathcal{j \sim N_i}} p(y_j|z_j) s(i, j)
20406080100Number of annotation / 10000.20.40.60.81Accuracy
{'budget': 10000, 'batch_size': 100, 'sample': 'random'} bayes+knn/acc
{'budget': 10000, 'batch_size': 100, 'sample': 'random'} bayes+cv_prior/acc
{'budget': 10000, 'batch_size': 100, 'sample': 'random'} bayes/acc
Random Sample
13
Kcenter
13



Random Sample
14
Kcenter
14