arXiv Analytics

Sign in

arXiv:0705.2765 [cs.LG]AbstractReferencesReviewsResources

On the monotonization of the training set

Rustem Takhanov

Published 2007-05-18Version 1

We consider the problem of minimal correction of the training set to make it consistent with monotonic constraints. This problem arises during analysis of data sets via techniques that require monotone data. We show that this problem is NP-hard in general and is equivalent to finding a maximal independent set in special orgraphs. Practically important cases of that problem considered in detail. These are the cases when a partial order given on the replies set is a total order or has a dimension 2. We show that the second case can be reduced to maximization of a quadratic convex function on a convex set. For this case we construct an approximate polynomial algorithm based on convex optimization.

Related articles: Most relevant | Search more
arXiv:cs/0212035 [cs.LG] (Published 2002-12-12)
Exploiting Context When Learning to Classify
arXiv:2210.06005 [cs.LG] (Published 2022-10-12)
Generative Adversarial Nets: Can we generate a new dataset based on only one training set?
arXiv:2406.11794 [cs.LG] (Published 2024-06-17, updated 2024-06-18)
DataComp-LM: In search of the next generation of training sets for language models
Jeffrey Li et al.