Zum Hauptinhalt springen

The Bech-Rafaelsen Melancholia Scale (MES) in clinical trials of therapies in depressive disorders: a 20-year review of its use as outcome measure

Bech, Per
In: Acta Psychiatrica Scandinavica, Jg. 106 (2002-09-11), S. 252-264
Online unknown

The Bech–Rafaelsen Melancholia Scale (MES) in clinical trials of therapies in depressive disorders: a 20-year review of its use as outcome measure. 

Bech P. The Bech–Rafaelsen Melancholia Scale (MES) in clinical trials of therapies in depressive disorders A 20‐year review of its use as outcome measure. Acta Psychiatr Scand 2002: 106: 252–264. © Blackwell Munksgaard 2002. Objective: To evaluate the psychometric properties of the Bech–Rafaelsen Melancholia Scale (MES) by reviewing clinical trials in which it has been used as outcome measure. Method: The psychometric analysis included internal validity (total scores being a sufficient statistic), interobserver reliability, and external validity (responsiveness in short‐term trials and relapse prevention in long‐term trials). Results: The results showed that the MES is a unidimensional scale, indicating that the total score is a sufficient statistic. The interobserver reliability of the MES has been found adequate both in unipolar and bipolar depression. External validity including both relapse, response and recurrence indicated that the MES has a high responsiveness and sensitivity. Conclusion: The MES has been found a valid and reliable scale for the measurement of changes in depressive states during short‐term as well as long‐term treatment.

Keywords: rating scales; melancholia scale; validity; reliability; acute therapy; long‐term therapy

History of the Bech–Rafaelsen Melancholia Scale (MES)

Although the Bech–Rafaelsen Melancholia Scale (MES) ([1]) was published in the same year as the DSM‐III ([2]) the scale was constructed independently of the DSM‐III universe of symptoms defining the diagnosis of major depression.

Table 1 shows the DSM‐IV ([3]) symptoms included in the algorithm of major depression as well as the MES symptoms with their reference both to the Cronholm–Ottossons' Depression Scale ([4]) and to the Hamilton Depression Scale (HAM‐D) ([5]). For a comparison, the Montgomery–Åsberg Depression Scale (MADRS) ([6]) is also shown in Table 1.

The MES is based on the core items of depression as identified by Bech et al. ([7]) when making the first validation study of the HAM‐D, using a global clinical assessment of the severity of depression as the index of validity. The global assessment was made by experienced psychiatrists on a straight vertical line from 0=no depression to 10=maximum depression. The line was provided with numbers (1=doubtful depression, 2–4= mild depression, 5–7=moderate depression, and 8–10=severe depression). The results showed that this global assessment was associated statistically with only six of the 17 items included in the HAM‐D ([7]). These six items are shown in Table 1.

The MES is also based on the Cronholm–Ottosson Depression Scale [for a comprehensive overview, see Bech ([8])]. The first version of the Cronholm–Ottosson Depression Scale was published in the same year as the first version of the HAM‐D ([9], [10]), but differed from the HAM‐D in its construct. Thus, the Cronholm–Ottosson Depression Scale was designed to rate changes in depressive states during treatment, while the HAM‐D originally was designed for measuring the severity of clinical symptomatology of depressive states. As a consequence, the Cronholm–Ottosson Depression Scale included only nine items in its first version ([9]) and eight items in its final version (Table 1). The HAM‐D versions have even been enlarged from 17 to 28 items to cover the full picture of the depressive symptomatology ([11]).

When validating the Cronholm–Ottosson Depression Scale ([8], [9]) the index of validity was the global improvement assessed by an experienced psychiatrist (0=unimproved; 1=slight improvement; 2=marked improvement, and 3=full remission). Table 1 shows that from the Cronholm–Ottosson Depression Scale the MES, which is based on the HAM‐D6, has included the symptoms of suicidal thoughts, sleep disturbance, poor concentration and emotional retardation.

MADRS, like the Cronholm–Ottosson Depression Scale, was designed to rate changes during treatment. This scale includes most of the MES items, but lacks the important item of motor retardation which in the MES is separated into verbal retardation and motor retardation. On the other hand, the MADRS includes the item of reduced appetite.

In this review of the MES, the psychometric properties of the scale will be shown with special emphasis on its external validity in being sensitive to measuring changes in depressive states during treatment, not only in regard to responsiveness during short‐term antidepressive therapies, but also when recognising relapse during medium‐term treatment or recurrences of depression during long‐term therapy. Results both with the HAM‐D6 and the MES will be taken into consideration.

The psychometric principle, consisting of internal validity, reliability, and external validity, has been applied. Internal validity contains both content validity (the extent to which the MES covers the DSM‐IV symptoms of major depression) and construct validity (the extent to which the total score of the MES items is a sufficient measure of changes in depressive states). Reliability signifies the interrater reliability or degrees of interrater agreement. External validity covers both responsiveness (the ability of the MES to measure changes in the depressive patients during different antidepressive treatments) and sensitivity (the ability of the MES to discriminate between active therapy and placebo).

Internal validity

Internal validity covers both content and construct validity.

Content validity.

The internal validity of the MES is high as the scale includes all symptoms of major depression according to DSM‐IV, apart from appetite/weight (Table 1). Because the MES is a measure of change during treatment the item of appetite/weight has been excluded. Thus, in our studies with the HAM‐D17 in which both appetite and weight are individual items we found that these items are often indicators of side‐effects of therapy rather than indicators of changes in depressive states ([7]). When measuring changes during treatment with patients with seasonal affective disorders (K.P.J. Martiny, C. Simonsen, M. Lunde, personal communication) we found that the MES was more sensitive than the HAM‐D because these patients are 'atypical' in having an increased appetite and not a reduced appetite.

Table 1 shows also that the MES includes two items not to be found in DSM‐IV, namely psychic anxiety and emotional retardation. In studies with the HAM‐D the item of psychic anxiety has been very important. Thus, we ([12]) and van Praag ([13]) have found that the depressive states are linked to psychic anxiety.

In his last analysis of the HAM‐D, Hamilton ([14]) showed that both in females and males with depressive illness (melancholia) the items of depressed mood, lack of interests, tiredness and psychic anxiety are the most inclusive (core) symptoms.

Concerning the item of emotional retardation it has been found important, when the MES is used, together with the Bech–Rafaelsen Mania Scale (MAS) to cover bipolar states of mania and melancholia ([15][16][17]), ranging from the lack of contact with others to intrusiveness in social contacts.

Construct validity.

The assessment made by use of a rating scale is, as discussed by Hamilton ([18]), a skilled perception. Table 2 shows the levels of skilled perceptions as proposed by Dodwell ([19]) in which he refers to the Einstein statement: 'It is the theory which decides what we can observe'. When combining symptoms into states of depression, including changes in depression over time, the flow of information has both a passive and an active component in the skilled perception. The passive component deals with the perception of signals (symptoms) while the active component deals with the combination of signals (syndromes) into a diagnosis or into a dimension of change in the illness over time.

Information theory takes into account the amount of information that each symptom contains. Items with no amount of information have a redundancy in their message (no significant signal). Thus, a redundancy of 50% means that the average number of items per patient is only half of what it could be if all the items of the scale were transmitting significant clinical information. Information theory has several ways of measuring such redundancy. Miller ([20]) was the first to show that human observers are very limited when measuring information in terms of redundancy. They behave, as discussed by Dodwell ([19]), like limited‐capacity information channels. This was the background for Broadbent ([21]) to use algorithms to define mental states. DSM‐IV introduced algorithms for the diagnosis of mental states including major depression. The analysis of the HAM‐D with global ratings ([7]) was followed by a latent structure analysis to test the construct of the HAM‐D6 ([22]). It was shown that the HAM‐D6 was unidimensional (total score a sufficient statistic) which again was confirmed by Bech et al. ([23]) when evaluating the changes in depressive states during therapy with antidepressants.

Table 3 illustrates the principle of the latent structure analysis as found with the HAM‐D6 Miller ([20]), when establishing his information theory, found that the limited capacity for skilled perceptions was seven items plus or minus two. As one of the human constructs of the perspective of time he used the days of the week as an example. Latent structure analysis is a way to test the amount of information in the individual items. If they can be rank‐ordered as shown in Table 3, such as Tuesday is always closer to Monday than to Friday, the total score is a sufficient statistic of a scale.

Maier and Philipp ([24]) were the first to compare the HAM‐D17, MES and MADRS by latent structure analysis. In this study the MES, but not HAM‐D17 or the MADRS fulfilled the Rasch criteria. This was replicated by Maier et al. ([25]). The unidimensionality of the MES was also confirmed by Chambon et al. ([26]) by using the Rasch analysis in patients with DSM‐III major depression of an age between 18 and 80 years. However, the item of suicidal thoughts had small deviations from the unidimensional model in some of the subanalyses. Table 4 shows that Maier et al. ([25]) found that only HAM‐D6 and the MES were accepted by the latent structure (Rasch) analysis. The lack of undimensionality of MADRS has also been confirmed by Allerup ([27]) Therefore, the MADRS and HAM‐D17 were considered as multidimensional scales.

Table 5 shows the rank order of the MES scale in the French study ([28]). This structure has been compared with the MES‐correspondent items found sensitive to treatment by Nelson et al. ([29]).

Table 6 shows the item‐total score analysis of the MES ([30]) and the MADRS ([31]) which is another way to indicate that the items should be positively correlated with the total score of a scale if the total score is a sufficient statistic of severity ([32]). Davidson et al. ([31]) concluded that the MADRS seems not to be unidimensional, suicidal thoughts, reduced sleep, and appetite lead to lower coefficients.

Factor analyses of the MADRS and the MES have been very limited, probably because the use of factor analysis with the HAM‐D has illustrated that this method is very sensitive to the dispersion of scale scores in the sample of patients under examination ([22]). This has, for instance, been found for the MADRS by Galinowsky and Lehert ([33]) who showed that in depressed patients before therapy more than one factor was identified of which the general factor explained only 32% of the variance. However, after 4 weeks of therapy when the dispersion of the scale score was the largest, the general factor explained 66% of the variance. The only MES study on depressed patients comparable with the MADRS study ([33]) is the study by Chambon et al. ([28]) who examined depressed patients with the same age as included in the MADRS study ([33]), i.e. mean age approximately 42 years. In this MES study, however, the patients were only examined once, namely before therapy, and the first factor was a general factor explaining only 32% of the variance, i.e. of the same order as in the MADRS study ([33]).

The other MES studies with factor analysis have been carried out either on elderly subjects of whom only 20% were depressed ([34]) or on psychotic patients predominantly suffering from acute schizophrenia ([35]). In the latter study confirmatory factor analysis was used, identifying before therapy three factors which descriptively were called 'depressive core symptoms', 'retardation', and 'accessory depressive symptoms'.

Among latent structure analyses the Mokken analysis ([36]) should be considered a more useful test for unidimensionality than factor analysis within the frame of reference to correlation coefficients. Thus the Loevinger coefficient of homogeneity expresses the rank‐ordering of items in terms of transmitted information concerning the dimension under investigation ([37]). The coefficient of homogeneity in the Mokken analysis is the Loevinger coefficient. If the rank order is acceptable the Loevinger coefficient should be 0.40 or higher. For the MES a Loevinger coefficient of 0.47 was obtained while it was 0.30 for the HAM‐D17 and 0.44 for the HAM‐D6 ([38]).

The HAM‐D6 as well as the MES are scales measuring the severity of depressive states. They are not diagnostic (e.g. endogenous vs. reactive depression). In accordance with this, Thase et al. ([39]) found that an index of endogenous depression (the Hamilton Endogenomorphic Scale, HES), including items such as agitation, weight loss, and diurnal variation, correlated very poorly with the HAM‐D6 (0.33).

The interrater reliability

Interrater reliability is often shown as the intraclass coefficient when several raters are interviewing the same group of patients (joint ratings). Table 7 shows the interrater reliability as found by Maier et al. ([25]) when comparing the MES with the HAM‐D17, and the MADRS, both before treatment and after 3 weeks of therapy. In the DUAG (Danish University Antidepressant Group) studies the intraclass coefficient of the MES has typically been evaluated in depressed patients with a MES score of around 20 and an intraclass coefficient of 0.75 or higher have been obtained, equal to the coefficients found for the HAM‐D17 ([40]). In the first study on the interrater reliability of an MES the intraclass coefficient was 0.82 ([41]). Kørner et al. ([42]) found an interclass coefficient of 0.81, while Smolka and Stieglitz ([43]) found an intraclass coefficient of 0.87. In patients with depression comorbid with Parkinson's Disease, Årsland et al. ([44]) found an intraclass coefficient of 0.90, while Rossi et al. ([17]) found an intraclass coefficient of 0.92 in bipolar states.

In a judgement analysis study with the MES a Spearman coefficient of 0.98 was found when testing the test–retest reliability ([45]).

While only one version of the MES or the MADRS exist, [e.g. Schutte and Malouff ([46])], several different versions of the HAM‐D have been published over the years [e.g. Hooijer et al. ([47])]. Hamilton published two different versions of the HAM‐D ([5], [10]), of which the 1967 version is the one that is used most widely (e.g. the item of agitation measured on a Likert scale from 0 to 4, and not from 0 to 2). In drug trials the American version ([48]) is most frequently used internationally ([49]), probably because the American Food and Drug Association (FDA) prefers this version. However, Hamilton ([50]) never accepted this version. The version used in the DUAG (Danish University Antidepressant Group) ([51]) has been accepted by Hamilton ([12]) and has, as stated by Hooijer et al. ([47]), been considered as the standard version for Europe. Compared with the DUAG version, the total HAM‐D17 ([5]) was scored approximately one point higher in the study by Hooijer et al. ([47]). The American version includes more than 17 items and this often constitutes a problem when interpreting the severity of depression assessed with this version.

The external validity

Figure 1 shows the terminology of response, remission, relapse, and recovery in the treatment of a major depressive episode as outlined by Frank et al. ([52]) or Kupfer ([53]). Figure 1 is a modification of the version by Bech ([54]) in which 'early improvement 'is included and the percentage of change has a reference to the MES.

1.  The terminology of response, remission, relapse and recovery in accordance with Kupfer (52), but modified to include 'early improvement' and percentage of changes with reference to the Melancholia Scale (55).

The HAM‐D is the standard rating scale for measuring severity of depressive states. Paykel ([55]) has formulated the world‐wide familiarity with the HAM‐D: '...We all know the approximate meaning of a Hamilton score of 17, 13, or 26 ...' Over the years the HAM‐D scores have been associated with the following clinical concepts: major depression (a HAM‐D17 score of 18 or more), response (a 50% reduction of the pretreatment HAM‐D17 score), relapse or of recurrence (a HAM‐D17 of 18 or more). Kørner et al. ([42]) have shown that a MES score of 15 approximately equals a HAM‐D17 score of 18 and a MADRS score of 24.

Improvement and response in short‐term trials.

The first study comparing the sensitivity to change of the HAM‐D17, MES, and MADRS using the global clinical improvement after three weeks of therapy as index of validity ([25]) showed a coefficient of validity for the HAM‐D17 of 0.69, for the MES of 0.77, and for MADRS of 0.63. The difference between MES and MADRS was statistically significant ([25]).

Figure 1 shows the MES scores as typically seen in out‐patients with major depression, i.e. with a mean MES score before treatment of 20; an improvement score after 2 weeks of therapy of 15 (25%); and a response score after 4 weeks of therapy of 10 (50%). Only in patients with a response score of 10 or less it is meaningful to evaluate relapse during the continuation therapy. A relapse is defined as a MES score of 15 or more.

Table 8 shows the results with the MES in randomized clinical trials in samples of patients representing the whole spectrum of depression, from delusional depression ([56], [57]) over non‐delusional in‐patients with major depression ([51], [58]) to out‐patients with major depression ([59], [60]) and out‐patients with mixed minor/major depression ([61]). In these trials the serotonine‐specific reuptake inhibitors or SSRIs (paroxetine or citalopram) have been compared with tricyclics (imipramine, clomipramine, or maprotiline).

However, in the two trials on delusional depression the patients have been treated with a combination of ECT and imipramine or paroxetine ([56]), while amitriptyline has been combined with haloperidol when compared with risperidone alone ([57]).

Table 8 can be considered as the responsiveness of the MES in the spectrum of depressive states. In all the in‐patient trials (the four toplisted trials in Table 8) tricyclics are significantly superior to the SSRIs or risperdone after 4 weeks of therapy. In the DUAG ([58]) study the difference between clomipramine and paroxetine is already statistically significant after 2 weeks of therapy. This is in accordance with the meta‐analysis published by Anderson and Tomenson ([62]) or Anderson ([63]). In‐patients with major depression have a higher proportion of 'endogenous' or 'somatic' features. These endogenous features include items outside the universe of MES (such as distinct quality of depression, lack of reactivity, depression regularly worse in the morning, or significant anorexia and weight loss). When reviewing trials in which depressed patients have been subgrouped as having 'endogenous' features, Parker ([64]) showed that tricyclic antidepressants are superior to the SSRIs.

In such meta‐analyses the most frequently used statistic for outcome is the effect size which also has been used by Anderson ([63]). We have shown that effect size is a valid statistic when compared with other forms of statistics in meta‐analyses ([65]). The effect size is defined as the mean change between baseline and endpoint (typically 4–6 weeks of therapy) minus the mean change for the reference drug, divided by the pooled standard deviation of these differences.

As the HAM‐D17 has been used in over 95% of all controlled trials with SSRIs ([54]) it is the effect size of this scale that was used by Anderson and Tomenson ([62]) and Anderson ([63]), although the effect size is independent of the scale used in the trials. We found ([65]) that the HAM‐D6 had a higher effect size than HAM‐D17 when all placebo‐controlled trials with fluoxetine were analysed (0.38 vs. 0.30). Faries et al. ([66]) have shown that as a consequence of this difference in effect size it would require approximately one‐third fewer patients in studies using HAM‐D6 than in studies using HAM‐D17.

The effect size of 0.30 in a comparison of fluoxetine with placebo in patients with major depression ([65]) equals a response rate difference of between 15 and 20% on the HAM‐D17, the same magnitude as seen in other controlled trials with old or new antidepressants.

This limited difference in the efficacy of antidepressants compared with placebo in patients with mild to moderate degrees of depression has been discussed by Moncrieff ([67]). Among the problems Moncrieff raises is the use of HAM‐D17 as an outcome measure because this scale contains too many non‐specific or redundant depression items. Antidepressants with a strong antihistaminic effect, (e.g. amitriptyline or mirtazapine) are 'multidimensional' drugs as they have an impact on all the items of the 'multidimensional' HAM‐D17. In the out‐patient trials with mirtazapine compared with amitriptyline and placebo it has been found ([68]) that on the HAM‐D6 both drugs are superior to placebo, with effect sizes of 0.40 or higher.

In several studies, Angst and Stassen have shown that the delayed onset of action of antidepressants is 'something of a myth' ([69]). They introduced the concept of early improvement as a decrease in baseline rating scores of 20% or more, as the day to day fluctuations in the baselines HAM‐D17 scores, i.e. before treatment, are around 20%. Table 8 shows that the various types of antidepressive therapies have an early improvement of approximately 20% or more, but that statistical significance is first reached after 4 weeks of therapy, except for the DUAG ([58]) study.

Table 8 includes the study by Licht and Quitzau ([60]) in which patients with major depression in the private psychiatric practice setting were treated with a fixed dose of 50 mg daily of sertraline. Patients were then classified as responders or non‐responders after 6 weeks of therapy. The non‐responders went into a follow‐up trial over another 6 weeks. In total, 78% completed the 6 weeks open trial. Of these patients 24% were non‐responders. Already after 2 weeks of therapy the responders had improved significantly better than the non‐responders on the MES (Table 8).

The background for developing scales with a high degree of responsiveness is also to predict remission as early as possible during the first weeks of therapy. In the acute therapy of manic states it has recently been shown that the Bech–Rafaelsen Mania Scale (MAS) can predict remission after 11 days of therapy ([70]). This is in accordance with the overview of the psychometric properties of the MAS ([15]). In the acute therapy of depressive states the early improvement model (2 weeks) seems adequate as well (Table 8). Thus, using Kaplan–Meier curves, Angst and Stassen ([69]) have shown that the early onset of improvement after 2 weeks of therapy is highly predictive of later outcome. They have found that only around 20–25% of the patients are non‐responders. In regard to those patients who responded, Angst and Stassen ([69]) claimed that the effectiveness of antidepressants or placebo merely appears to trigger and maintain the conditions necessary for response.

Augmentation in non‐responders in short‐term trials.

In non‐responders to antidepressive medication, lithium augmentation was introduced by de Montigny et al. ([71]). They showed that the HAM‐D6 was more sensitive than the HAM‐D17 to measure response in therapy‐resistant patients with major depression. This effect of lithium has also been found for the SSRIs ([54]).

In the Standardized Stepwise Drug Treatment Regimen (SSTR) used since 1990 at the Department of Psychiatry at the Free University of Berlin ([72]) the algorithm for non‐response to treatment is a reduction of the MES scores by less than 25% after 2 weeks of therapy. In such cases the dose of the antidepressant should be increased. If the baseline MES has not been reduced by 50% or more after 4 weeks of therapy lithium is added. Bschor et al. ([73]) have analysed those depressed patients who in the period from 1990 to 1996 have received lithium augmentation according to the SSTR protocol ([72]). In total, 37 patients (or 52%) responded to the lithium augmentation which is equal to that found by Katona et al. ([74]) in a controlled clinical trial. Bschor et al. ([73]) demonstrated that the lithium responders were more severely depressed on the MES than the non‐responders.

Of the various augmentations (e.g. lithium, thyroid hormones, or pindolol) in therapy‐resistant depression lithium augmentation is still the most well‐documented medication ([75]).

As to the combination of antidepressants, Maes et al. ([76]) showed that the combination of fluoxetine and mianserin was superior to the combination of fluoxetine and pindolol. In a series of trials combining mianserin with imipramine ([77], [78]) or mianserin with fluoxetine ([79]) it has been shown that mianserin improves the efficacy of the imipramine or fluoxetine therapy. In these trials the MES has been found superior to HAM‐D17 in discriminating between the treatments.

Relapse prevention in medium‐term trials.

When reviewing the randomized clinical trials with antidepressants ([54]) it was demonstrated that there have been conducted very few trials in the continuation phase (Fig. 1), i.e. the medium‐term treatment following the initial, acute treatment phase. According to the European Guidelines for Clinical Investigations of Antidepressants ([80]) a substance is accepted as an antidepressant if it can be shown to be more effective than placebo, not only in the acute phase but also in the medium‐term (continuation) phase.

Moncrieff ([67]) has criticised the medium‐term or relapse prevention trials because many of them rely on a discontinuation design. In these studies, patients who have responded to treatment with antidepressants are randomized to continuing on the active drug or being withdrawn to an inert placebo. Table 9 shows the two trials in which the MES has been used to demonstrate either a relapse prevention after ECT or after light therapy. In both trials the criticism raised by Moncrieff ([67]) has, thus, been taken into consideration.

The percentage of relapses in depression over 6 months after ECT in patients with a full remission has been used to evaluate the prophylactic effect of lithium or imipramine ([81]). The first study to use this model for serotonin selective reupdate inhibitors was published by Lauritzen et al. ([56]) where paroxetine was compared with imipramine not only during the acute phase in combination with ECT (Table 8), but also in the continuation phase after ECT. In this study, a relapse was defined either as a score of 18 or more on HAM‐D17 or as a score of 15 or more on the MES. In Table 9 the study by Lauritzen et al. ([56]) has been reanalysed to show the percentage of relapse for the two scales separately. The results indicated that only for the MES the difference between paroxetine and imipramine was statistical significant (P < 0.05). However, for the HAM‐D17 the level of statistically significance in favour of paroxetine was 0.06.

This 'ECT model' for relapse prevention has been used by Martiny et al. (personal communication) to compare citalopram with placebo in patients with mild to moderate depression (seasonal affective disorder) who had remitted after 1 week of light therapy. The relapse criteria in this study on less severely depressed patients compared with the ECT study ([56]) were scores for the MES of 11 or more and for the HAM‐D17 of 13 or more. The sensitivity of the MES (Table 9) was higher than that of the HAM‐D17 when citalopram was compared with placebo.

Recurrence prevention in long‐term trials.

The analysis of the antidepressive properties of the SSRIs have mainly been made in non‐bipolar (unipolar) patients with recurrent major depressive episodes ([54]).

Post‐stroke depression.

The MES has been compared with HAM‐D17 in a study on the prevention of depressive states after stroke ([82]). All patients were followed for up to 1 year after the stroke. The criteria for depression were similar to the criteria used for the MES and the HAM‐D17, as shown in Table 9. Sertraline was found statistically superior to placebo on the MES after 16 weeks of therapy, while on the HAM‐D17 this difference first was seen after 21 weeks of therapy.

Unipolar patients.

In a double‐blind, placebo‐controlled study, the prophylactic effect of citalopram has been evaluated in elderly patients with recurrent major depression ([83]). This study was, in principle, a discontinuation study in which the patients before randomization to placebo in an open trial had responded to citalopram both in the acute treatment phase and in the continuation phase. In the maintenance phase the patients were followed over 52 weeks after the randomization. In this double‐blind maintenance part of the trial half of the patients received citalopram (n=60) and the other half placebo (n=61). The patients were evaluated every 4 weeks during the 52 weeks of therapy with the MES, MADRS, and the HAM‐D17.

Table 10 shows the results when the corresponding cut‐off scores were used for the three scales ([8], J. Bent‐Hansen, M. Lunde, K. Solstad, personal communication). The most clear discrimination between citalopram and placebo was seen for the MES showing that only 7% had a recurrence on citalopram against 38% on placebo. This is similar to the findings in Table 9 concerning the relapse‐prevention of the SSRIs.

Bipolar patients.

The long‐term mood‐stabilizing effect in bipolar manic‐melancholic patients has shown the superiority of lithium over anticonvulsants ([15]). However, only few trials have been published and in these trials the Bech–Rafaelsen Mania Scale (MAS) has most frequently been used in combination with the HAM‐D.

In a double‐blind trial over 2 years, lithium was compared with carbamazepine in bipolar patients and both the MES and the MAS scales were used as outcome measures ([84]). The results showed that lithium was superior to carbamazepine in preventing new episodes.

In another study both MES and MAS were used when comparing daily lithium dosing to lithium dosing every second day. The results showed that 50% developed new episodes of depression or mania in the every second day dosing regimen while only 10% developed new episodes in the once daily dosing ([85]).

Discussion

In their monograph on rating scales for the measuring of health and disease, McDowell and Newell ([86]) have concluded that '... compared with the Hamilton Scale the MADRS is easily administered and provides clearer guidelines to the rater. In turn, however, the Bech–Rafaelsen modification of the Hamilton Scale (MES) seems to offer further improvement...' This improvement in relation to the HAM‐D or the MADRS is, as discussed in the following, to be found in the psychometric triangle of validity, interrater reliability, and external validity (Fig. 2).

2.  Internal validity, reliability and external validity of the Bech–Rafaelsen Melancholia Scale, illustrated by the psychometric triangle.

Internal validity

The Bech–Rafaelsen Melancholia Scale (MES) has been shown to cover the same universe of symptoms as the DSM‐IV checklist for major depression. Symptoms with high diagnostic validity may, however, not necessarily have a high validity in measuring changes in severity of mental illnesses ([87], [88]). However, this review over empirical studies on the validity of the MES has shown that the 11 items constitute a unidimensional measure of depressive states. Both the Mokken analysis and the Rasch analysis have confirmed that the MES items have an internal hierarchy that is more consistent than the HAM‐D17 or the MADRS.

Interrater reliability

As discussed elsewhere ([89]) rating scales with a strong internal structure fulfilling the Rasch criteria for unidimensionality have also an adequate reliability. Empirical studies with the MES have shown that the interrater reliability is adequate with intraclass coefficients between 0.75 and 0.92, both in unipolar and bipolar patients. These coefficients are as high as those found for the DSM‐III or DSM‐IV diagnosis of major depression.

External validity

In this review the external validity of the MES has focused on early improvement, response and relapse or recurrence of major depressive episodes. When compared with the HAM‐D17 or the MADRS, the MES has been found superior especially in the trials on relapse and recurrence of major depressive episodes.

In the few studies with the MES on different biological markers of depression the scale has also been found superior to the HAM‐D17 ([91], [92]).

However, in studies on the plasma level vs. clinical response neither the MES nor the HAM‐D17 have show correlations of statistical significance ([51], [58], [93]).

As discussed by Hamilton ([18]) many scales have been developed to predict response to treatment. Such predictive scales have included what Hamilton refers to as 'fixed' items, for example, items such as family history, number of previous episodes, length of illness and types of personality. However, as stated by Hamilton ([18]) none of these scales have come into extensive use.

In a study on patients with chronic non‐malignant pain the MES has been found predictive for the outcome of antidepressants ([94]). Thus, a score of 15 or more was by Loldrup et al. ([94]) found to be associated with a high degree of remission with clomipramine when compared with placebo.

The MES trials included in the external validity analysis (Table 8) have not been able to conclude that the tricyclic antidepressants and the SSRIs are equally effective. In the acute therapy of depression the tricyclic antidepressants are superior to the SSRIs in hospitalized patients with severe degrees of depression. On the other hand, in the relapse prevention of patients who already have responded to ECT paroxetine was found superior to imipramine. This signifies that in the full‐blown, severe state of depression, the re‐establishment of the serotonin function is not enough, i.e. other neurotransmitters may be involved in the severe depressive states. However, after remission, in the relapse–prevention phase, the function of the SSRIs to maintain the serotonine balance seems sufficient. This theory is confirmed not only in the ECT study ([56]) but also in the light therapy study (K.P.J. Martiny, M. Lunde, C. Simonsen, personal communication) as well as in the recurrence‐prophylactic studies ([83] J. Bent‐Hansen, M. Lunde, K. Solstad, personal communication). The MES has been recommended among the clinician‐based scales to measure efficacy in the treatment of major depressive disorder ([95]).

The MES self‐rating version

A questionnaire to be completed by the patients themselves based on the HAM‐D17/MES version ([12]) has been developed by Bent‐Hansen et al. ([96]). They found a correlation coefficient of the MES self‐rating version of 0.68 when compared with the observer version of MES. This is within the limits of 0.40 and 0.75 often found when the Beck Depression Inventory (BDI) ([97]) or the Zung Self‐rating Depression Scale (SDS) ([98]) have been compared with HAM‐D17.

Conclusion

The MES is the first depression scale to fulfil the basic psychometric criteria of internal validity, reliability, and external validity. The MES covers approximately the DSM‐IV checklist of major symptoms of depression (content validity) and the scale is unidimensional, i.e. the total score is a sufficient statistic to assess severity of illness (construct validity). The interobserver reliability has been found to be acceptable in various settings.

The external validity of the MES has been tested both in regard to sensitivity and responsiveness. The sensitivity has been demonstrated in trials on the relapse prevention of depressive episodes in which a high MES discrimination was shown between active therapy and placebo when compared with other depression scales.

The responsiveness of the MES has been found high in short‐term trials in which the MES discrimination between TCAs and SSRIs was of statistical significance. Moreover, the improvement measured after 2 weeks of treatment was predictive for the final response.

References 1 BECH P, RAFAELSEN OJ. The use of rating scales exemplified by a comparison of the Hamilton and the Bech–Rafaelsen Melancholia Scale. Acta Psychiatr Scand 1980 ; 62 (Suppl. 285) : 128 – 132. 2 AMERICAN PSYCHIATRIC ASSOCIATION. Diagnostic and Statistical Manual of Mental Disorders, 3rd edn (DSM‐III). Washington DC: American Psychiatric Association, 1980. 3 AMERICAN PSYCHIATRIC ASSOCIATION. Diagnostic and Statistical Manual of Mental Disorders, 4th edn (DSM‐IV). Washington DC: American Psychiatric Association, 1994. 4 ÅSBERG M, KRAGH‐SORENSEN P, MINDHAM RH, TUCK JR. International reliability and communicability of a rating scale for depression. Psychol Med 1973 ; 3 : 458 – 465. 5 HAMILTON M. Development of a rating sale for primary depressive illness. Br J Soc Clin Psychol 1967 ; 6 : 278 – 296. 6 MONTGOMERY SA, ÅSBERG M. A new depression scale designed to be sensitive to change. Br J Psychiat 1979 ; 134 : 382 – 389. 7 BECH P, GRAM LF, DEIN E, JACOBSEN O, VITGER J, BOLWIG T. G Quantitative rating of depressive states. Acta Psychiatr Scand 1975 ; 51 : 161 – 170. 8 BECH P. Cronholm–Ottosson Depression Scale. The first specific depression scale for the rating of changes during treatment. Acta Psychiatr Scand 1991 ; 84 : 439 – 445. 9 OTTOSSON J‐O. Experimental studies of the mode of action of electroconvulsive treatment. Acta Psychiatr Scand 1960 ; 35 (Suppl. 145) : 69 – 97. HAMILTON M. A rating scale for depression. J Neurol Neurosurg Psychiatry 1960 ; 23 : 56 – 62. O'SULLIVAN RL, FAVA M, AUGUSTIN C, BAER L, ROSENBAUM JF. Sensitivity of the six‐item Hamilton Depression Rating Scale. Acta Psychiatr Scand 1997 ; 95 : 379 – 384. BECH P, KASTRUP M, RAFAELSEN OJ. Mini‐compendium of rating scales for states of anxiety, depression, mania, schizophrenia with corresponding DSM–III syndromes. Acta Psychiatr Scand Supplement 1986 ; 326 (73): 7 – 37. VAN PRAAG H. Anxiety/aggression‐driven depression. A paradigm of functionalization and verticalization of psychiatric diagnosis. Prog Neuropsychopharmacol Biol Psychiatry 2001 ; 25 : 893 – 924. HAMILTON M. The effect of treatment on the melancholias (depressions). Br J Psychiatry 1982 ; 140 : 223 – 230. BECH P. The Bech–Rafaelsen Mania Scale (MAS) in clinical trials for bipolar disorder. A twenty‐year review of its use as an outcome measure. CNS Drugs 2002 ; 16 : 47 – 63. BECH P, BAASTRUP PC, DE BLEEKER E, ROPERT R. Dimensionality, responsiveness and standardisation of the Bech–Rafaelsen Mania Scale in the ultra‐short therapy with antipsychotics in patients with severe manic episodes. Acta Psychiatr Scand 2001 ; 104 : 25 – 30. ROSSI A, DANELUZZO E, ARDUINI L et al. A factor analysis of signs and symptoms of the manic episode with Bech–Rafaelsen Mania and Melancholia Scales. J Affect Disord 2001 ; 64 : 267 – 270. HAMILTON M. The role of rating scales in psychiatry. Psychol Med 1976 ; 6 : 347 – 349. DODWELL P. Brave New mind. New York: Oxford University Press, 2000. MILLER GA. The magical number of seven, plus and minus two: some limits on our capacity for processing information. Psychol Rev 1956 ; 63 : 81 – 97. BROADBENT DE. Deception and Communication. New York: Pergamon, 1958. BECH P, ALLERUP P, GRAM LF et al. The Hamilton Depression Scale. Evaluation of objectivity using logistic models. Acta Pscychiatr Scand 1981 ; 63 : 290 – 299. BECH P, ALLERUP P, REISBY N, GRAM LF. Assessment of symptom change from improvement curves on the Hamilton Depression Scale in trials with antidepressants. Psychopharmacol 1984 ; 84 : 276 – 281. MAIER W, PHILLIP M. Comparative analysis of observer depression scales. Acta Psychiatr Scand 1985 ; 72 : 239 – 245. MAIER W, PHILIPP M, HEUSER I, SCHLEGEL S, BULLER R, WETZEL H. Improving depression severity assessment. Reliability, internal validity and sensitivity to change of three observer depression scales. J Psychiatr Res 1988 ; 22 : 3 – 12. CHAMBON O, CIALDELLA P, KISS L, PONCET F. Study of the unidimensionality of the Bech–Rafaelsen Melancholia Scale using Rasch analysis in a French sample of major depressive disorders. Pharmacopsychiatry 1990 ; 23 : 243 – 245. ALLERUP P. Statistical Analysis of Rating Scales. Copenhagen: Danish Institute of Educational Research, 1986. CHAMBON O, PONCET F, KISS L, MILANI D, COTTRAUX J. Adaptation française, validation concurrente et analyse factorielle de l'échelle de mélancolie de Bech et Rafaelsen. [French adaptation, concurrent validation and factorial analysis of the Bech and Rafaelsen melancholia scale]. Encéphale 1988 ; 14 : 443 – 448. NELSON JC, MAZURE C, QUINLAN DM, JATLOW PI. Drug‐responsive symptoms in melancholia. Arch Gen Psychiatry 1984 ; 41 : 663 – 668. RAFAELSEN OJ, BECH P, BOLWIG TG, KRAMP P, GJERRIS A. The Bech–Rafeaelsen combined rating scale for mania and melancholia. In: ACHETÉ K, AALBERG V, LÖNNQVIST J, eds. Psychiatria Fennica 1980 (Suppl.) :327–331 DAVIDSON J, TURNBALL CD, STRICKLAND R, MILLER G, GRAVES K. The Montgomery–Åsberg Depression Scale: reliability validity. Acta Psychiatr Scand 1986 ; 73 : 544 – 548. LIKERT R. A technique for measurement of attitudes. Arch Psychol 1932 ; 140 : 1 – 55. GALINOWSKI A, LEHERT P. Structurel validity of MADRS during antidepressant treatment. Int Clin Psychopharmacol 1995 ; 10 : 157 – 161. MARCOS T, SALAMERO M. Factor study of the Hamilton Rating Scale for Depression and the Bech Melancholia Scale. Acta Psychiatr Scand 1990 ; 82 : 178 – 181. MüLLER J, WETZEL H, SZEGEDI A et al. Three dimensions of depression in patients with acute psychotic disorders: a replication study. Compr Pychiatry 1999 ; 40 : 449 – 457. MOKKEN RJ. A Theory and Procedure of Scale Analysis. Paris: Mouton, 1971. DE JONG A, MOLENAAR IW. An application of Mokken's model for stochastic, cumulative scaling in psychiatric research. J Psychiat Res 1987 ; 2 : 137 – 149. BECH P, STAGE KB, NAIR NPV et al. The major depression rating scale (MDS). Interrater reliability and validity across different settings in randomized moclobemide trials. J Affect Disord 1997 ; 42 : 39 – 48. THASE ME, HERSEN M, BELLACK AS, HIMMELHOCH JM, KUPFER DJ. Validation of a Hamilton subscale for endogenomorphic depression. J Affect Disord 1983 ; 5 : 267 – 278. STAGE KB, BECH P, KRAGH‐SORENSEN P, NAIR NP, KATONA C. Differences in symptomatology and diagnostic profile in younger and elderly depressed inpatients. J Affect Disord 2001 ; 64 : 239 – 248. BECH P, GJERRIS A, ANDERSEN J et al. The melancholia scale and the Newcastle scales. Item‐combinations inter‐observer reliability. Br J Psychiatry 1983 ; 143 : 58 – 63. KøRNER A, NIELSEN BM, ESCHEN F. Quantifying depressive symptomatology: interrater reliability and inter‐item correlation. J Affect Disord 1990 ; 20 : 143 – 149. SMOLKA M, STIEGLITZ RD. On the validity of the Bech–Rafaelsen Melancholia Scale (BRMS). J Affect Disord 1999 ; 54 : 119 – 128. ÅRSLAND D, LARSEN JP, LIM NG, WERMUTH L, BECH P. α2‐Adrenoreceptor antagonism and serotonin reuptake inhibition in patients with Parkinson disease and depression. Nord J Psychiatry 2000 ; 54 : 411 – 415. BECH P, HAABER A, JOYCE CRB, DUAG. Experiments on clinical observation and judgement in the assessment of depression: profiled videotapes and judgement analysis. Psychol Med 1986 ; 16 : 873 – 883. SCHUTTE NS, MALOUFF JM. Sourcebook of Adult Assessment Strategies. New York: Plenum Press, 1995. HOOIJER C, ZITMAN FG, GRIEZ E, VAN TILBURG W, WILLEMSE A, DINKGREVE MA. The Hamilton Depression Rating Scale (HDRS): changes in scores as a function of training and version used. J Affect Disord 1991 ; 22 : 21 – 29. GUY W. Clinical Global Impressions. Early Clinical Drug Evaluation Unit (ECDEU) Rockville, MD: National Institute of Mental Health 1976 :218–222. ZITMAN FG, MENNEN MF, GRIEZ E, HOOIJER CI. The different versions of the Hamilton Depression Rating Scale. Psychopharmacol Ser 1990 ; 9 : 28 – 34. HAMILTON M, SHAPIRO CM. Depression. In: PECK DF, SHAPIRO CM, eds. Measuring Human Problems. Chichester: Wiley, 1990: pp 25–65. Danish University Antidepressant Group. (DUAG). Citalopram: clinical effect profile in comparison with clomipramine. A controlled multicenter study. Psychopharmacology 1986 ; 90 : 131 – 138. FRANK E, PRIEN RF, JARRETT RB. Conceptualization and rationale for consensus definitions of terms in major depression. Arch Gen Psychiatry 1991 ; 48 : 851 – 855. KUPFER DJ. Long‐term treatment of depression. J Clin Psychiatr 1991 ; 52 (Suppl. 5) : 28 – 34. BECH P. Pharmacological treatment of depressive disorders: a review. In: MAJ M, SARTORIUS N, eds. Depressive Disorders. WPA Series Evidence and Experience in Psychiatry. Chichester: John Wiley & Sons, 1999:89–127. PAYKEL ES. Use of the Hamilton Depression Scale in General Practice. In: BECH P, COPPEN A, eds. The Hamilton Scales. Berlin: Springer, 1990:40–47. LAURITZEN L, ODGAARD K, CLEMMESEN L et al. Relapse prevention of paroxetine in ECT treated patients with major depression. A comparison with imipramine and placebo in medium‐term continuation therapy. Acta Psychiatr Scand 1996 ; 94 : 241 – 251. MüLLER‐SIECHENEDER F, MüLLER MJ et al. Risperidone versus haloperidol and amitriptyline in the treatment of patients with a combined psychotic and depressive syndrome. J Clin Psychopharmacol 1998 ; 18 : 111 – 120. DANISH UNIVERSITY ANTIDEPRESSANT GROUP. Paroxetine: a selective serotonin reuptake inhibitor showing better tolerance, but weaker antidepressant effect than clomipra‐mine in a controlled multicenter study. J Affect Dis 1990 ; 18 : 289 – 299. ØHRBERG S, CHRISTIANSEN PE, SEVERIN B et al. Paroxetine and imipramine in the treatment of depressed patients in psychiatric practice. Acta Psychiatr Scand 1992 ; 86 : 437 – 444. LICHT RW, QUITZAU S. A placebo‐controlled study of treatment strategies in patients with major depression not responding to first line sertraline treatment: extended duration of treatment, dose increase or mianserin augmentation. Psychopharmacol 2002 ; 161 : 143 – 151. SZEGEDI A, WETZEL H, ANGERSBACH D et al. A double‐blind study comparing paroxetine and maprotiline in depressed outpatients. Pharmacopsychiatry 1997 ; 30 : 97 – 105. ANDERSON IM, TOMENSON BM. The efficacy of selective serotonin re‐uptake inhibitors in depression: a meta‐analysis of studies against tricyclic antidepressants. J Psychopharmacol 1994 ; 8 : 238 – 249. ANDERSON IM. SSRIs versus tricyclic antidepressants in depressed inpatients: a meta‐analysis of efficacy and tolerability. Depr Anx 1998 ; 7 (Suppl. 1) : 11 – 17. PARKER G. New and old antidepressants: all equal in the eyes of the lore. Br J Psychiatry 2001 ; 179 : 95 – 96. BECH P, CIALDELLA P, HAUGH M, BIRKETT MA. A meta‐analysis of randomised controlled trials of fluoxetine versus placebo and tricyclic antidepressants in the short‐term treatment of major depression. Br J Psychiatr 2000 ; 176 : 421 – 428. FARIES D, HERRERA J, RAYAMAJHI J, DEBROTA D, DEMITRACK M, POTTER WZ. The responsiveness of the Hamilton Depression Rating Scale. J Psychiatr Res 2000 ; 34 : 3 – 10. MONCRIEFF J. Are antidepressants overrated? A review of methodological problems in antidepressant trials. J Nerv Ment Dis 2001 ; 189 : 288 – 295. BECH P. Meta‐analysis of placebo‐controlled trials with mirtazapine using the core items of the Hamilton Depression Scale as evidence of a pure antidepressive effect in the short‐term treatment of major depression. Int J Neuropsychopharmacol 2001 ; 4 : 337 – 345. ANGST J, STASSEN HH. Do antidepressants really take several weeks to show effect? In: LEONARD BE, eds. Antidepressants. Basel: Birkhäuser, 2001: 21–30. FRANK E, KUPFER DJ, GEREBTZOFF A et al. The developement of study exit criteria for evaluating antimanic compounds. J Clin Psychiatry 2001 ; 62 : 421 – 425. DE MONTIGNY C, GRUNBERG F, MAYER A, DESCHENES JP. Lithium induces rapid relief of depression in tricyclic antidepressant drug non‐responders. Br J Psychiatry 1981 ; 138 : 252 – 256. LINDEN M, HELMCHEN H, MACKERT A, MULLER‐OERLINGHAUSEN B. Structure and feasibility of a standardized stepwise drug treatment regimen (SSTR) for depressed inpatients. Pharmacopsychiatry 1994 ; 27 (Suppl. 1) : 51 – 53. BSCHOR T, CANATA B, MULLER‐OERLINGHAUSEN B, BAUER M. Predictors of response to lithium augmentations in tricyclic antidepressant‐resistant depression. J Affect Disord 2001 ; 64 : 261 – 265. KATONA CL, ABOU‐SALEH MT, HARRISON DA et al. Placebo‐controlled trial of lithium augmentation of fluoxetine and lofepramine. Br J Psychiatry 1995 ; 166 : 80 – 86. PRICE LH, CARPENTER LL, RASMUSSEN SA. Drug combination strategies. In: AMSTERDAM, JD, HORNIG, M, NIERENBERG, AA, eds. Treatment‐Resistant Mood Disorders. Cambridge: Cambridge University Press, 2001: 194–238. MAES M, LIBBRECHT I, VAN HUNSEL F, CAMPENS D, MELTZER HY. Pindolol and mianserin augment the antidepressant activity of fluoxetine in hospitalized major depressed patients, including those with treatment resistance. J Clin Psychopharmacol 1999 ; 19 : 177 – 182. AURITZEN L, CLEMMESEN L, KLYSNER R et al. Combined treatment with imipramine and mianserin. A controlled pilot study. Pharmacopsychiatry 1992 ; 42 : 182 – 186. AURITZEN L, BJERG BENDSEN B, VILMER T, BJERG BENDSEN E, LUNDE M, BECH P. Post‐stroke depression: combined treatment with imipramine or desipramine and mianserin. A controlled clinical study. Psychopharmacology 1994 ; 114 : 119 – 122. AM J, RYDE L, SVEJSØ J, LANGE N, LAURITSEN B, BECH P. Morning fluoxetine plus evening mianserin versus morning fluoxetine plus evening placebo in the acute treatment of major depression. Pharmacopsychiatry 1998 ; 73 : 393 – 400. EUROPEAN UNION. European guidelines for clinical investigations of antidepressants. Eur Neuropsychopharmacol 1994 ; 4 : 62 – 65. SCHOU M. Prophylactic and maintenance therapy in recurrent affective disorders. In: GALLANT, DB, SIMPSON, GM, eds. Depression: Behavioural, Biochemical, Diagnostic and Treatment Concepts. New York: Spectrum Publications. 1976:309–326. RASMUSSEN A. Depression efter apoplexi (in Danish) [Depression after stroke]. PhD Thesis. Copenhagen: University of Copenhagen, 2000. KLYSNER R, HANSEN HL, BENT‐HANSEN J et al. The efficacy of citalopram in the prevention of recurrent depression in elderly patients: a placebo‐controlled study on maintenance therapy. Br J Psychiatry 2002 (in press). HARTONG E. Het profylactisch effect van lithium en carbamazepine bij patienten met een bipolaire stoornis (The prophylactic effect of lithium and carbamazepine in patients with bipolar disorder). Thesis. Nijmegen: University of Nijmegen, 2000. JENSEN HV, PLENGE P, MELLERUP ET et al. Lithium prophylaxis of manic depressive disorder: daily lithium dosing schedule versus every second day. Acta Psychiatr Scand 1995 ; 92 : 69 – 74. MCDOWELL I, NEWELL C. Measuring Health. A Guide to Rating Scales and Questionnaires. New York: Oxford University Press, 1987. FRANCES A, PINCUS HA, WIDIGER TA. DSB‐IV: work in progress. Am J Psychiatry 1990 ; 147 : 1439 – 1448. KESSLER RC, MROCZAK M. Measuring the effect of medical intervention. Med Care, 1995 ; 33 (Suppl. 4) : 109 – 119. BECH P. Rating Scales for Psychopathology, Health Status and Quality of Life. A Compendium on Documentation in Accordance with the DSM‐III‐R and WHO Systems. Berlin: Springer, 1993. SCHLEGEL S. Brain density in depression: methodological and psychopathological aspects. Acta Psychiatr Scand 1988 ; 78 : 610 – 612. SCHLEGEL S, MAIER W, PHILIPP M et al. Computer tomography in depression: Association between ventricular size and psychopathology. Psychiatry Res 1988 ; 28 : 221 – 230. SCHLEGEL S, NIEBER D, HERRMANN C, BAKAUSKI E. Latencies of the P 300 component of the auditory event‐related potential in depression are related to the Bech–Rafaelsen Melancholic Scale, but not to the Hamilton Rating Scale for depression. Acta Psychiatr Scand 1991 ; 83 : 438 – 440. BOURDEAUX R, PANNETIER P, YOUNOS C, DESOR D, LEHR PR, CAPOLACH B. Fluoxetine: relationship among plasma concentrations and therapeutic effects in the treatment of patients with major depression at 20 mg/day. L'Encéphale 1998 ; 24 : 57 – 61. LOLDRUP D, LANGEMARK M, HANSEN HJ. The validity of the Melancholic Scale (MES) in predicting outcome of antidepressants in chronic idiopathic pain disorders. Eur Psychiatry 1991 ; 6 : 119 – 125. BAUER M, WHYBROW PC, ANGST J et al. Guidelines for the treatment of unipolar depressive disorder. World J Psychiatry 2002 ; 3 : 3 – 43. BENT‐HANSEN J, LAURITZEN L, CLEMMESEN L, LUNDE M, KØRNER A. A definite and a semidefinite questionnaire version of the Hamilton/Melancholia (HDS/MES) Scale. J Affect Disord 1995 ; 33 : 143 – 150. BECK AT, WARD CH, MENDELSON M. An inventory for measuring depression. Arch Gen Psychiatry 1961 ; 4 : 561 – 571. ZUNG WWK. A self‐rating depression scale. Arch Gen Psychiatry 1965 ; 12 : 63 – 70.

By P. Bech

Reported by Author

Titel:
The Bech-Rafaelsen Melancholia Scale (MES) in clinical trials of therapies in depressive disorders: a 20-year review of its use as outcome measure
Autor/in / Beteiligte Person: Bech, Per
Link:
Zeitschrift: Acta Psychiatrica Scandinavica, Jg. 106 (2002-09-11), S. 252-264
Veröffentlichung: Wiley, 2002
Medientyp: unknown
ISSN: 0001-690X (print)
DOI: 10.1034/j.1600-0447.2002.01404.x
Schlagwort:
  • medicine.medical_specialty
  • animal structures
  • Psychometrics
  • Construct validity
  • Test validity
  • Relapse prevention
  • Clinical trial
  • External validity
  • Psychiatry and Mental health
  • Rating scale
  • medicine
  • Physical therapy
  • Internal validity
  • Psychiatry
  • Psychology
Sonstiges:
  • Nachgewiesen in: OpenAIRE
  • Rights: CLOSED

Klicken Sie ein Format an und speichern Sie dann die Daten oder geben Sie eine Empfänger-Adresse ein und lassen Sie sich per Email zusenden.

oder
oder

Wählen Sie das für Sie passende Zitationsformat und kopieren Sie es dann in die Zwischenablage, lassen es sich per Mail zusenden oder speichern es als PDF-Datei.

oder
oder

Bitte prüfen Sie, ob die Zitation formal korrekt ist, bevor Sie sie in einer Arbeit verwenden. Benutzen Sie gegebenenfalls den "Exportieren"-Dialog, wenn Sie ein Literaturverwaltungsprogramm verwenden und die Zitat-Angaben selbst formatieren wollen.

xs 0 - 576
sm 576 - 768
md 768 - 992
lg 992 - 1200
xl 1200 - 1366
xxl 1366 -