EFFICACY OF GINKGO BILOBA SPECIAL EXTRACTS — EVIDENCE FROM RANDOMIZED CLINICAL TRIALS
Since their introduction on the market in the mid 1960s, the use of phytopharmaceuticals based on Ginkgo biloba extract has become increasingly popular in several West-European countries and, more recently, in the USA as well. Positive clinical results of “Ginkgo” treatment have been claimed for patients suffering from a wide variety of health problems, such as peripheral arterial insufficiency, intermittent claudication, Raynaud’s syndrome, diabetic retinopathy, vertigo, tinnitus, cochlear deafness, senile macular degeneration, erectile dysfunction, cognitive impairment and dementia. In spite of the quite heterogeneous appearance of this list of medical conditions for which the use of Ginkgo preparations is sometimes indicated, most of them are thought to have a common, vascular, origin: an impaired blood supply of the affected organ or tissue. The aim of this chapter is to review the evidence regarding the clinical efficacy of Ginkgo biloba extract that is currently available in the scientific literature.
It was decided to restrict the scope of the chapter in several ways. Firstly, the review will deal exclusively with those intervention studies that have been carried out in human patients. It will not cover the evidence gathered through animal studies or in vitro investigations. In general, studies involving healthy volunteers will not be taken into consideration either. Secondly, only those studies will be addressed that focus predominantly on “clinical” endpoints, such as general well-being, clinical signs and symptoms, functional capacity (assessed either in daily life situations or under laboratory conditions), health complaints, etc. Studies with an exclusive focus on metabolic or physiological effects of Ginkgo biloba products and constituents will be left out from the review, as their primary interest is in the disclosure of the mechanisms of action underlying the clinical effects of Ginkgo, not in the clinical efficacy as such. Thirdly, the review concentrates on published evidence from investigations designed to exert a sufficient amount of control over potential sources of bias. This means that only the results of randomized controlled trials will be presented and discussed, and that the evidence based on, for instance, case series, open trials, and other types of design, will be neglected. A final restriction concerns the type of intervention: the review includes only studies conducted to assess the efficacy of a Ginkgo biloba special extract (GBSE). These special extracts contain fixed amounts of the substance classes that are held responsible for its clinical efficacy (usually: 24-25% Ginkgo flavonol glycosides and 6% terpene trilactones), obtained through a highly standardized and sophisticated process of extraction, purification and concentration.
The chapter is structured as follows. Firstly, results and experiences from previous Ginkgo reviews will be summarized briefly. Secondly, the strategy that was chosen for the current review will be explained. Subsequently, a systematic overview of the published intervention studies on the efficacy of Ginkgo biloba and their results will be presented. For this purpose the clinical trials identified were grouped into three main categories, that cover almost the entire domain of interest: trials on patients suffering from peripheral arterial disease; trials on patients with cerebral insufficiency and related disorders (including Alzheimer’s disease and vascular dementia); and a rest category, dealing with miscellaneous health problems (e.g. mountain sickness), that do not fit well in either the cerebral insufficiency or the peripheral arterial disease cluster.
PREVIOUS LITERATURE REVIEWS OF GINKGO BILOBA
Types of Literature Review
Several approaches can be chosen to review the published evidence for the efficacy of a medical intervention. Often a renowned subject matter expert is invited to formulate a summary judgment regarding the efficacy and the safety of the intervention. In general, such a narrative review bears the stamp of the authority and the personal preferences of the reviewer. It often lacks explicit statements regarding the searching strategy applied, the inclusion and exclusion criteria used to select studies, and the main arguments giving rise to the overall conclusion. In order to circumvent the limitations inherent to this classical approach of reviewing, more systematic and explicit approaches have been developed during the past two decades. The criteria-based systematic review and the quantitative meta-analysis are two modern types of systematic literature review that have become popular in the medical domain, not at least due to the worldwide promotional efforts of the Cochrane Collaboration.
The criteria-based systematic review — or qualitative review — stresses the need for a critical and structured methodological assessment of each study that qualifies for the review, preferably in a “blinded” fashion. Sometimes this assessment is backed up by a scoring system that is suited to value each of a series of relevant methodological quality aspects separately, and to calculate a summary quality score for the study at large. The idea behind this approach is that methodological quality should play a major role in weighing the evidence of each of the published studies that are available to base an overall judgement upon.
The quantitative meta-analysis honours the principle of statistical pooling of the results of several intervention trials addressing the same research question. Statistical pooling allows for an overall estimation of the size of the intervention effect which is more precise than the effect estimations for each of the studies separately.
Efforts to extract a valid summary judgment from the literature may be hampered in several ways. First of all, selective publication of empirical evidence and selective retrieval of published evidence may result in biased conclusions, irrespective of the type of systematic review. Heterogeneity as to methodological quality of a series of studies addressing the same research question, is a second obstacle. This problem is best addressed by means of a criteria-based systematic review. Heterogeneity of studies with regard to various design aspects that not necessarily affect the methodological quality — e.g. choice of study subjects, choice of type and duration of intervention, choice of outcome measures — can be regarded as a third sort of problem, which especially interferes with statistical pooling.
Ginkgo biloba Reviews
Several reviews assessing the overall effect of Ginkgo biloba in patients with various health problems have been published in the past. The majority of them can be classified as narrative reviews, although some tend to a systematic approach with a strong narrative component.
Kleijnen and Knipschild (1991,1992ab) published a criterion-based systematic review, dealing with both patients suffering from cerebral insufficiency and patients diagnosed with intermittent claudication. For the methodological assessment of the trials eligible for the review, they applied a checklist of 7 criteria, which referred to the adequacy of patient characteristics (10 points), number of patients analysed (30 points), the randomization procedure (20 points), the intervention procedure (5 points), double blinding (20 points), outcome measurement (10 points), and data presentation (5 points), respectively. Only 2 of the 15 trials that could be retrieved for the intermittent claudication review, were classified as “acceptable”, with a total quality score=65 points, viz. 75 and 66 points, respectively (Bauer, 1984; Saudreau et al, 1989). All studies, irrespective of their methodological quality, indicated a positive effect of Ginkgo, which was specified as statistically significant for 9 studies, and as a positive trend for 6 studies (positive effect, but not statistically significant for all outcome measures). Of the 40 studies included in the cerebral insufficiency review only 8 were considered to have an “acceptable” methodological quality, with a total score >65 points. No less than 39 studies indicated a positive effect of Ginkgo, which was found to be statistically significant for 26 studies, and a positive trend for 13. Therefore, study outcome appeared not to be clearly associated with methodological quality. From the score matrices an insufficient sample size, an inadequately described randomization procedure and a poor description of patient characteristics and outcome measures can be identified as the main methodological shortcomings.
Statistical pooling of the results of Ginkgo biloba trials is discouraged by the remarkable heterogeneity with regard to relevant design aspects. This heterogeneity concerns in particular the patient definitions used (various diagnostic screening tools, various criteria for inclusion and exclusion) and the outcome parameters applied (different levels of observation, various rating scales, psychometric tests and other outcome measures). The intervention procedures and contrasts tend to show more similarity. Schneider (1992) published a meta-analysis based on the pooled analysis (calculation of effect sizes) of 5 randomized controlled trials including patients with peripheral arterial disease. And Hopfenmuller (1994) pooled the results of several trials which focussed on patients with cerebral insufficiency. These quantitative meta-analyses illustrate that statistical pooling is feasible even for Ginkgo trials. However, this can only be attained at the cost of the exclusion of much valuable evidence embedded in trials that are regarded not admissible to the “pool”, due to unconciliatory design choices. For instance, while the review by Kleijnen and Knipschild (1992b) contained 40 “cerebral insufficiency” trials, only 11 were considered eligible for the quantitative meta-analysis conducted by Hopfenmuller (1994), which concentrated on the Ginkgo biloba special extract LI 1370 (Kaveri forte) and on a subjective, clinical symptom checklist as the main outcome measure. Eight trials qualified for the analyses based on the evaluation of single symptoms of cerebral insufficiency and the total score for clinical symptoms. Statistically significant effects were found for all symptoms separately and in combination (total score), except for one trial. This indicates the superiority of Ginkgo over placebo. After dichotomization of the total scores (“improved” vs. “not improved”) a pooled odds ratio of 2.0 (95% confidence interval: 1.4; 2.6) was calculated. Six trials could be used for the meta-analysis based on global assessment of the treatment effect, either by the physician or by the patient, as the main outcome measure. After dichotimization (“very good” or “good” vs. “moderate” or “unsatisfactory” or “no effect”) a pooled odds ratio of 2.9 (95% confidence interval: 2.1; 3.7) was calculated. These data suggest that the rate of success for Ginkgo treatment is about two to three times that for placebo treatment.
Actually, all reviews referred to above seem to agree upon the beneficial effect of Ginkgo in patients suffering from cerebral insufficiency or peripheral arterial disease.
DESIGN OF THE CURRENT REVIEW
STRATEGY FOR SEARCHING THE LITERATURE
STRUCTURE OF THE TABLES IN THE CURRENT REVIEW
CLINICAL EFFICACY OF GINKGO BILOBA IN PERIPHERAL ARTERIAL DISEASE
CLINICAL EFFICACY OF GINKGO BILOBA IN CEREBRAL INSUFFICIENCY
Introduction
Since 1975 a considerable number of intervention studies have been published aimed at building evidence for the efficacy of Ginkgo biloba special extract in subjects with various manifestations of cerebral insufficiency. Most of them were projected as a randomized clinical trial (RCT). However, in some cases the description of the design in the literature (random allocation procedure, blinded outcome measurement, etc.) does not make clear whether they indeed deserve this qualification. The current review discusses 55 studies. They tend to show more differences than similarities. The recent review by Letzel et al. (1996), regarding the nootropic effect of Ginkgo biloba in comparison with two other active substance classes (tacrine, nimodipine), illustrates some approaches to distinguish a limited number of more or less homogeneous subcategories of trials. It shows that reference can be made to various design aspects, such as the diagnostic inclusion criteria applied, the measures used to assess the severity of the disease, or the type of instrument used to assess the intervention outcome.
To structure the current overview an attempt was made to divide the Ginkgo trials into several categories based on the main treatment indication. The following categories were distinguished:
1. Studies focussing on patients with dementia, including Alzheimer’s disease, vascular dementia, and mixed type dementia
2. Studies focussing on non-demented patients with cognitive decline
3. Studies focussing on patients with cerebral insufficiency expressed primarily by physical, non-cognitive manifestations (tinnitus, vertigo, hearing loss, etc.)
4. Studies focussing on patients with a mixture of cognitive and non-cognitive symptoms, diagnosed and evaluated through a standard symptom checklist
5. Studies focussing on patients with affective manifestations of cerebral insufficiency, especially studies focussing on depressive mood as the leading symptom
6. Studies focussing on patients with cerebral insufficiency according to diagnostic information that does not permit further differentiation of the disease state (cerebral insufficiency, unspecified).
We are aware, however, that this classification system can be challenged and that part of the trials will not fit exactly in one of the categories. More than average attention will be paid to the first category of trials. They are of special interest, due to their focussed character — restriction of the patient definition to a more precise and homogeneous disease entity than is the case for the majority of cerebral insufficiency trials — in combination with the claim that Ginkgo biloba special extract might benefit patients with mild to moderate stages of both Alzheimer’s dementia and vascular dementia. Most of the trials in this category have been conducted and reported quite recently. Therefore, it will not be surprising that they surpass most of the previous trials as for methodological quality and other relevant design aspects.
SUMMARY OF STUDY CHARACTERISTICS AND STUDYRESULTS
MISCELLANEOUS CONDITIONS
CONCLUSIONS
After having seen all these trial results, the key question remains to be answered. Can clinical therapy with Ginkgo biloba be regarded effective? Or, stated differently, will the evidence that is currently available from clinical studies, encourage (relatives of) patients with complaints of either cerebral or peripheral arterial insufficiency to ask for Ginkgo preparations, or physicians to prescribe them, or health authorities to consider the licensing of Ginkgo-based drugs?
The evidence that is rooted in the controlled trials included in this review is rather convincing. In general, the results point to a significant effect of Ginkgo biloba treatment, both in patients with peripheral arterial disease and in patients with dementia and several related manifestations of cerebral insufficiency. Other suggested domains of intervention have not yet been explored to such a large extent that solid conclusions can be drawn already. The consistency between the results of studies addressing the same target health problem, is remarkable. However, in several trials the positive effect did not concern all levels of outcome measurement. Furthermore, it should be stressed that only a minority of the trials can stand the test of criticism posed by the currently accepted — general and disease-specific — guidelines and prescriptions for the design and conduct of clinical intervention trials. As a consequence, a substantial part of the available evidence is contributed by a relatively small proportion of all Ginkgo trials. Most of them were published quite recently. Although the time-related difference in design choices hampers any comparison, one cannot get away from the impression that, generally speaking, the effect sizes reported in earlier trials exceed those of more recent trials. On average, larger sample sizes were used in the later trials than in the earlier ones. Some of these trials had enough “power” to make even apparently non-impressive treatment effects statistically significant.
The clinical importance of the effects shown by the most recent, high-quality studies remains debatable. Such a discussion is seriously hampered by the high level of abstraction of the concepts that underlie many of the questionnaires, tests and rating scales which tend to be used as effect measures. The outcome of such a debate will be also influenced by the prevailing ideas regarding the treatability and reversibility of the health problems under consideration. Finally, it goes without saying that any conclusion rests on the assumption that the current view of the efficacy of Ginkgo biloba treatment is not seriously biased by any form of selective reporting and publication.