Empirische Inferenz Article 2007

Mining complex genotypic features for predicting HIV-1 drug resistance

Human immunodeficiency virus type 1 (HIV-1) evolves in human body, and its exposure to a drug often causes mutations that enhance the resistance against the drug. To design an effective pharmacotherapy for an individual patient, it is important to accurately predict the drug resistance based on genotype data. Notably, the resistance is not just the simple sum of the effects of all mutations. Structural biological studies suggest that the association of mutations is crucial: Even if mutations A or B alone do not affect the resistance, a significant change might happen when the two mutations occur together. Linear regression methods cannot take the associations into account, while decision tree methods can reveal only limited associations. Kernel methods and neural networks implicitly use all possible associations for prediction, but cannot select salient associations explicitly. Our method, itemset boosting, performs linear regression in the complete space of power sets of mutations. It implements a forward feature selection procedure where, in each iteration, one mutation combination is found by an efficient branch-and-bound search. This method uses all possible combinations, and salient associations are explicitly shown. In experiments, our method worked particularly well for predicting the resistance of nucleotide reverse transcriptase inhibitors (NRTIs). Furthermore, it successfully recovered many mutation associations known in biological literature.

Author(s): Saigo, H. and Uno, T. and Tsuda, K.
Journal: Bioinformatics
Volume: 23
Number (issue): 18
Pages: 2455-2462
Year: 2007
Month: September
Day: 0
Bibtex Type: Article (article)
DOI: 10.1093/bioinformatics/btm353
Digital: 0
Electronic Archiving: grant_archive
Language: en
Organization: Max-Planck-Gesellschaft
School: Biologische Kybernetik
Links:

BibTex

@article{4603,
  title = {Mining complex genotypic features
  for predicting HIV-1 drug resistance},
  journal = {Bioinformatics},
  abstract = {Human immunodeficiency virus type 1 (HIV-1) evolves in human body,
  and its exposure to a drug often causes mutations that enhance
  the resistance against the drug.
  To design an effective pharmacotherapy for an individual patient,
  it is important to accurately predict the drug resistance
  based on genotype data.
  Notably, the resistance is not just
  the simple sum of the effects of all mutations.
  Structural biological studies suggest that
  the association of mutations is crucial:
  Even if mutations A or B alone do not affect the resistance,
  a significant change might happen
  when the two mutations occur together.
  Linear regression methods cannot take the associations into account,
  while decision tree methods can reveal only limited associations.
  Kernel methods and neural networks implicitly use all possible
  associations for prediction, but cannot select salient associations
  explicitly.
  Our method, itemset boosting, performs linear regression
  in the complete space of power sets of mutations.
  It implements a forward feature selection procedure where,
  in each iteration, one mutation combination is
  found by an efficient branch-and-bound search.
  This method uses all possible combinations,
  and salient associations are explicitly shown.
  In experiments, our method worked particularly well for predicting the
  resistance of  nucleotide reverse transcriptase inhibitors
  (NRTIs). Furthermore, it successfully recovered many mutation
  associations known in biological literature.},
  volume = {23},
  number = {18},
  pages = {2455-2462},
  organization = {Max-Planck-Gesellschaft},
  school = {Biologische Kybernetik},
  month = sep,
  year = {2007},
  slug = {4603},
  author = {Saigo, H. and Uno, T. and Tsuda, K.},
  month_numeric = {9}
}