2024年5月4日发(作者:)
Bioinformatics Advance Access published September 6, 2005
BIOINFORMATICS
Computational methods for the design of effective therapies
against drug resistant HIV strains
Niko Beerenwinkel
1,*
, Tobias Sing
2
, Thomas Lengauer
2
, Jörg Rahnenführer
2
, Kirsten Roomp
2
,
Igor Savenkov
2
, Roman Fischer
3
, Daniel Hoffmann
3
, Joachim Selbig
4
, Klaus Korn
5
, Hauke
Walter
5
, Thomas Berg
6
, Patrick Braun
7
, Gerd Fätkenheuer
8
, Mark Oette
9
, Jürgen Rockstroh
10
,
Bernd Kupfer
11
, Rolf Kaiser
12
, Martin Däumer
12
Department of Mathematics, University of California, Berkeley, CA,
2
Max Planck Institute for Informatics, Saarbrücken,
Germany,
3
Center of Advanced European Studies and Research, Bonn, Germany,
4
Max Planck Institute of Molecular Plant
Physiology and University of Potsdam, Germany,
5
Institute of Clinical and Molecular Virology, University of Erlangen-
Nürnberg, Erlangen,, Germany,
6
Medical Laboratory, Berlin, Germany,
7
PZB, Aachen, Germany,
8
Department of Internal
Medicine, University of Cologne, Germany,
9
Department of Gastroenterology, University of Düsseldorf, Germany,
10
Department of Internal Medicine, University of Bonn, Germany,
11
Institute of Medical Microbiology and Immunology,
University of Bonn, Germany,
12
Institute of Virology, University of Cologne, Germany
Received on …; revised on …; accepted on …
Advance Access publication . . .
1
ABSTRACT
Motivation:The development of drug resistance is a major obstacle
to successful treatment of HIV infection. The extraordinary replica-
tion dynamics of HIV facilitates its escape from selective pressure
exerted by the human immune system and by combination drug
therapy. We have developed several computational methods whose
combined use can support the design of optimal antiretroviral thera-
pies based on viral genomic data.
1INTRODUCTION
Persons infected with human immunodeficiency virus type 1 (HIV-
1) are highly susceptible to develop the acquired immunodefi-
ciency syndrome (AIDS), a major global threat to human health.
HIV-1 is a retrovirus with a 9.2kbp genome coding for 15 viral
proteins. Currently, 19drugs targeting three distinct steps in the
viral replication cycle are available for antiretroviral therapy.
These drugs can be grouped into four different classes, according
to their target and mechanism of action. Nucleoside and nucleotide
analogues act as chain terminators in reverse transcription of RNA
to DNA. Non-nucleoside reverse transcriptase inhibitors bind to
and inhibit reverse transcriptase (RT), a viral enzyme that catalyzes
reverse transcription. Protease inhibitors target the HIV protease,
which is involved in maturation of released viral particles by cleav-
ing precursor proteins. Finally, entry inhibitors block the penetra-
tion of HIV virions into their target cells.
Cell entry is a complex process mediated by sequential interac-
tions of the viral proteins gp120 (envelope) and gp41 (transmem-
brane) with the cellular CD4 receptor and a coreceptor, usually
CCR5 or CXCR4, depending on the individual virion. Conse-
quently, different types of entry inhibitors have been proposed:
Fusion inhibitors prevent merging of viral and host cell membranes
by binding to the transmembrane protein gp41. In contrast, core-
*
To whom correspondence should be addressed.
ceptor antagonists bind to the host protein prior to membrane fu-
sion.
The available antiretroviral agents are applied in combination
therapies—so called highly active antiretroviral therapy (HAART),
typically comprising two nucleoside analogues and either a prote-
ase inhibitor or a non-nucleoside RT inhibitor. However, therapeu-
tic success, even of HAART, is limited. Antiretroviral therapy is
not able to eradicate HIV, and durable suppression of virus replica-
tion below detectable limits is achieved in only a fraction of pa-
tients. Drug resistance can be the cause of treatment failure and is
almost always a consequence of it (Clavel et al., 2004, DeGruttola
et al., 2000).
1.1Drug resistance
The intra-patient virus population is a highly dynamic system,
characterized by high virus production and turnover rates and a
high mutation rate. These evolutionary dynamics are the basis for a
large and diversified virus population that predisposes or quickly
generates resistance mutations. In a replicating population escape
mutants with a selective advantage under therapy become domi-
nant and lead to increased virus production and eventually to ther-
apy failure. A number of mutations in protease, RT, and gp41 have
been associated with resistance to different antiviral agents (Shafer
et al., 2000). Each drug has its own characteristic resistance profile
reflecting its chemical properties and mechanism of action. Never-
theless, cross-resistance (i.e. resistance against an unused drug) is
common between drugs from the same class. Therefore, HAART
advocates the use of two different drug classes in order to reduce
the likelihood of a mutant to resist all drugs in the combination and
to suppress viral replication more effectively (Jordan et al., 2004).
After treatment failure, the shifted population may be hit with a
new drug combination, but finding such a potent regimen is chal-
lenging. Cross-resistance severely limits the remaining treatment
options and the success of subsequent regimens is further impaired.
© The Author (2005). Published by Oxford University Press. All rights reserved. For Permissions, please email: sions@
The online version of this article has been published under an open access are entitled to use,reproduce,disseminate,or display the open access
version of this article for non-commercial purposes provided that:the original authorship is properly and fully attributed; the Journal and Oxford University Press
are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety
but only in part or as a derivative work this must be clearly indicated. For commercial re-use,please contact sions@
winkel et al.
The interplay between development of drug resistance and insuffi-
cient suppression of virus replication can eventually lead to situa-
tions in which the currently available drugs can no longer control
replication at all. In the United States, as many as 50% of patients
receiving HAART carry a virus that is resistant to at least one of
the approved drugs (Richman et al., 2004). Furthermore, transmis-
sion of drug resistant viruses is estimated to occur in about 15%of
persons newly diagnosed with HIV infection in the US (Bennett et
al., 2005).
Because cross-resistance is frequent, treatment changes cannot
be based on the assumption that the virus will remain susceptible
to the unused drugs. Therefore, resistance testing has become an
important diagnostic tool in the management of HIV-infections
(Perrin et al., 1998). Resistance testing can be performed either by
measuring viral activity in the presence and absence of a drug
(phenotypic resistance testing), or by sequencing the viral genes
coding for the drug targets (genotypic resistance testing). Geno-
typic assays are much faster and cheaper, but sequence data pro-
vide only indirect evidence of resistance.
The Arevir project is a collaborative effort between clinicians,
virologists, and computational biologists to exploit genotype data
from genotypic resistance tests for the individual selection of op-
timal drug combinations. We have developed several computa-
tional methods for the analysis of integrated genotypic, phenotypic
and clinical data. Our goal was to provide tools for supporting
personalized genotype-driven treatment decisions.
time. We have addressed the problem of estimating evolu-
tionary pathways from sequence data.
(4)Therapy optimization. Our ultimate goal is to determine
optimal drug combinations on the basis of genotypic in-
formation. For this task, we need to estimate the in vivo ef-
fect of a drug combination on a given viral genotype and to
identify the regimen that maximizes clinical response. In
addressing these problems we make use of both the in vitro
phenotype predictions and the estimated evolutionary
pathways.
For each of the four challenges we present computational ap-
proaches and indicate the biological or clinical impact. We show
how the developed tools can be linked together in order to support
the selection of effective therapies against drug resistant HIV
strains.
2DATA MANAGEMENT
In order to meet the data integration and management challenge we
have developed the Arevir database, a secure electronic platform
for collaborative research aimed at optimizing anti-HIV therapies.
This system is designed to facilitate data exchange, improve diag-
nostics, support medical decisions, and to provide the basis for data
analysis.
2.1Database schema
1.2Challenges
The following questions have been addressed and approaches to
their solutions will be described in the following sections.
(1)Data integration. A prerequisite for any attempt to use
genotypic data in a clinical setting is to provide this infor-
mation at the right time and place. Resistance testing is of-
ten performed in specialized virological labs separated
from the clinical department. Furthermore, most clinical
data management systems are not prepared to handle se-
quence data. Thus, our first task is to collect, organize, and
integrate all relevant patient data.
(2)Phenotype prediction from genotypes. The first step in
interpreting genotypic data is to understand the effect of
single mutations and to relate mutational patterns to the in
vitro phenotype. We have addressed predicting phenotypic
drug resistance from the viral drug targets as well as pre-
diction of coreceptor usage from gp120. Both models can
augment the cheaper and faster genotypic test with a pre-
diction of the phenotype, namely the susceptibility to each
of the drugs and the coreceptor in use, respectively. This
piece of information is important for the choice of therapy.
(3)Evolution of drug resistance. Understanding the muta-
tional pathways that lead to resistant strains is important
for two reasons. First, this knowledge allows for estimating
the distance of a virus population to escape from drug pres-
sure, a quantity referred to as the genetic barrier. Second,
the prediction of mutational pathways makes it possible to
design sequences of therapies rather than one regimen at a
In managing HIV-infected patients a number of different types of
data arise, including personal patient data, therapy histories, nu-
merous virologic, immunologic and other clinical test results de-
rived from patient samples from different tissues, and sequence
data, e.g. from genotypic resistance tests. Our database schema
captures these data types in different modules, consisting of a few
tables each (Beerenwinkel, 2004a).
There is an important relationship between sequences and thera-
pies via the drug targets. The compounds making up a combination
therapy target specific viral proteins. DNA segments coding for
these proteins are sequenced in order to gain information on the
level of resistance that has been developed by the virus. Thus,
given the values of clinical markers the data model allows for ask-
ing for outcomes of therapy types versus mutational patterns
within the drug targets. This is the central question of the Arevir
project. It will be revisited in a later section.
2.2Implementation
The data model has been implemented in the open source relational
database management system MySQL. A secured client/server
architecture allows for remote access to the centralized database.
Since sensitive patient data are involved, this setting needs to meet
the security demands imposed by state and national law. In addi-
tion, we have developed a web interface to the database for clini-
cians and virologists. For these users the appropriate view on the
data is through a single patient or a single patient sample. Thus,
treating physicians as well as lab personnel get access to an inte-
grated view onto all relevant data for one patient. For example,
they can evaluate a genotypic resistance test result in the context of
the patient’s medical history and current immunologic status.
Moreover, applying the developed computational tools yields phe-
notypic interpretations of the genotypes. As of 2005, the Arevir
2
发布评论