Skip to main content

Skarzynski Tinnitus Scale: validation of a brief and robust tool for assessing tinnitus in a clinical population



Many tinnitus scales are available, but all of them have certain limitations. The aim of the current study was to present a psychometric data of a new brief and reliable questionnaire that could be conveniently used for evaluating tinnitus complaint in adults (either with normal or impaired hearing)—Skarzynski Tinnitus Scale (STS).


The study included 125 participants with at least 1 month of tinnitus duration. All participants were asked to complete the STS, Tinnitus and Hearing Survey (THS), Tinnitus Functional Index (TFI), Tinnitus Handicap Inventory (THI), and Beck Depression Inventory. Psychometric properties of the new tool were tested using exploratory factor analysis (EFA), Pearson bivariate correlation with other tinnitus questionnaires, Pearson bivariate correlation with pure-tone audiometry, Cronbach’s alpha coefficient, limits of agreement, smallest detectable change, and floor and ceiling effects. Norms for tinnitus severity as measured by the STS are proposed.


As a whole, the STS has excellent reliability (ICC = 0.94) and good internal consistency (α = 0.91). The results of EFA and content analysis of wording of the items justified the three-factorial structure. The convergent validity was proven by a significant positive correlation with THI, TFI and THS Subscale A scores. Additionally, the authors proposed norms dividing the results into four tinnitus severity grades.


Statistical analysis shows that STS is a brief but robust tool well-suited to clinical practice. A feature of STS is that it takes into account the impact of tinnitus on the patient’s psychological (emotional, cognitive) and functional domains as well as their ability to cope with tinnitus-related distress.


Tinnitus is an auditory sensation generated by abnormal activation within the auditory system when no external sound is present [1]. It is commonly described by the sufferers as “ringing in the ears”, but it can take many forms, such as buzzing, hissing, chirping, and others. The prevalence of tinnitus is 4.4–15.1% in adults [2] and the number of tinnitus sufferers is significant not only among older adults but also children [3,4,5].

Tinnitus has a serious impact on everyday life, leading sometimes to poor psychological well-being, insomnia, difficulties in concentration, and others [6,7,8,9]. However, tinnitus is difficult to measure objectively because it is almost always a subjective phenomenon, and objective measures of tinnitus such as pitch or loudness only weakly correlate with the impact of tinnitus on various domains of life [8, 10,11,12]. For this reason, self-reported measures are widely used in clinical practice to quantify tinnitus severity.

Many tinnitus scales are available, but according to systematic reviews not all of them meet the criteria of good measures [13]. In Poland, cross-cultural adaptation and validation have been made for three tinnitus questionnaires: the Tinnitus and Hearing Survey (THS-POL), Tinnitus Handicap Inventory (THI-POL), and Tinnitus Functional Index (TFI-Pl) [14,15,16]; in the current literature these three have been found to be the most robust among the available tools [17]. However, all these questionnaires also have certain limitations.

The best psychometric properties in several Polish clinical patients were reported to be for the THS; however, its main aim is to differentiate bothersome tinnitus from hearing difficulties, and so its application in tinnitus diagnosis is limited. For the THI, the three-factor structure postulated by Newman et al. [18, 19] has not been confirmed in Polish patients [15], a finding in accordance with other reports [20]. Ultimately, the TFI is the product of a thorough and lengthy development and it is a very valuable assessment tool due to its multidimensionality as well as its ability to diagnose tinnitus in great detail. Notwithstanding, its eight-factor structure has not been fully confirmed [21], and likewise in Polish patients [16]. However, in our clinic many patients point out that some TFI items are not clear (for example, How depressed were you because of your tinnitus) and that the 10-point response scale (choosing one integer from a range 0–10, with definition only for the extreme points) is too wide and difficult to use. Difficulties have also been encountered while assessing tinnitus in patients with hearing problems, which constitute the majority in our tertiary center. From patient reports, it was often almost impossible to distinguish their hearing problems from their tinnitus problems, especially on the Auditory subscale (referring to questions such as ability to hear clearly, understanding people who are talking, or following conversations due to tinnitus) and the Quality of life subscale (questions such as enjoyment of life, social activities, and relationship disturbance due to tinnitus). These questions require a lot of explanation from the examiner, interfering with the study protocol (since the TFI should ideally be a self-reported measure) and this can potentially bias the results.

In clinical practice it was regularly noticed that many patients, despite reporting severe tinnitus, developed strategies to cope with the problem and minimize the impact of tinnitus on their daily life. Since every health complaint can be a source of distress [22], it is worth recording behavioral and cognitive efforts made by patients to manage the difficulties caused by their complaint, in this case tinnitus [23]. However, in the case of tinnitus, the role of particular coping strategies has not yet been firmly established [24]. For example, in a study conducted by Henry and Wilson [25], who compared two groups with self-reported high and low tinnitus distress, no difference was found in terms of coping strategies used by the participants or the benefits derived from the strategies. Additionally, a recent review on coping with tinnitus [26] concluded that although coping is a valuable factor in tinnitus research, there is a lack of specific tinnitus-coping questionnaires which have a solid structure in terms of isolating discrete factors.

Taking into consideration all the above-mentioned limitations of existing research tools, and the importance of evaluating tinnitus-coping strategies during clinical assessment, we decided to develop a new brief but solidly framed questionnaire that could be used in a busy clinic and which was convenient for adult tinnitus patients, either with normal hearing or with hearing impairment, to use. The following assumptions were made while developing the tool: (1) it should measure tinnitus severity and reflect the impact of tinnitus on everyday life; (2) it should be able to assess the efficacy of treatment; (3) it should allow the clinician to quickly and efficiently gauge the general coping difficulties of the patient (who might then be directed to other specialists such as counsellors and psychologists); (4) it should measure only tinnitus, not other hearing problems; (5) it should have appropriate psychometric properties with adequate validity and reliability. Furthermore, in our everyday work we felt the need to have a simple tool that could be used in a busy clinic and which was convenient for adults suffering tinnitus.

Development of the Skarzynski Tinnitus Scale

An initial draft of the new tool—the Skarzynski Tinnitus Scale (STS, named after the primary author)—consisted of 51 items generated by specialists working with tinnitus patients: a physician, psychologist, hearing aid specialist, and psychometrician. Based on a review of the specialist literature [13, 17, 27, 28], our own clinical experience, and analysis of available tools, the following domains were selected: emotional (negative feelings connected with tinnitus, e.g., anxiety, fear, annoyance), cognitive (intrusive thoughts), functional (impact on everyday life), and coping with tinnitus-related distress (efforts to reduce the negative effects of tinnitus). The domain concerning the impact of tinnitus on hearing was not taken into consideration because of the difficulties described earlier: tinnitus patients often suffer from hearing loss as well and so are unable to specify, for example, whether difficulties in understanding other people’s speech are due to hearing loss or tinnitus.

Seven experts were asked to evaluate the items (physician audiologists, psychologists, an audiophonologist, and hearing aid specialists working with tinnitus patients). They received the set of 51 items with the following instructions: Please assign a score from 1 to 5 to each question based on its usefulness in assessing treatment results in tinnitus patients: 1 for a completely inadequate question, 5 for a fully adequate one.

The criterion for selecting an item was a high average score given by the specialists (above 3.5) and an additional criterion was the absence of any extremely negative scores (completely inadequate). On this basis, the experimental version of the STS consisted of 33 items. A 5-degree scale of answers was established: never, hardly ever, sometimes, often, and always. Some items were reverse-scored to reduce acquiescence bias and extreme response bias. This version was completed by 44 patients and each of them was interviewed about the tool. They were asked if the questions were comprehensible and whether the scale of answers was easy to use. Many patients pointed out that the answers: hardly ever, sometimes, often, etc., were hard to interpret, and several patients found it difficult to understand some questions. Many patients complained about the large number of questions and some claimed to be fed up with them. They suggested the questionnaire should be shorter.

Patients’ opinions were essential. Since it is the patients who complete the questionnaires, our goal was to balance their expectations (concerning comprehensibility, number of items, and convenient way of completing it) with diagnostic requirements. The information acquired during the interviews and the results of the experimental version were instrumental in the final selection of items. The aim was to select up to 20 items with a coherent factor structure as well as good reliability. The most frequently used method to examine dimensionality of data is factor analysis. According to the guidelines given by de Vet [29], items that do not cohere with any other set of factors can be deleted, and Cronbach’s alpha can be used to reduce the number of items while maintaining an acceptable internal consistency. In this study an exploratory factor analysis (EFA) was performed for the set of 33 items to isolate a smaller set of items with a factor structure which would explain over 50% of the variance; at the same time the reliability was checked to ensure that Cronbach’s alpha was above 0.70. According to these criteria, 15 items were finally selected (see Additional file 1: Appendix S1). This final version was the subject of the core analyses presented below.



The study included 125 participants reporting tinnitus complaints who were consecutive patients attending the Audiology and Phoniatrics Clinic. The main eligibility criteria were age over 18 years, tinnitus of at least 1 month’s duration which lasted more than 5 min at a time, and a lack of mental disorders confirmed in the patient’s medical history. The study was conducted according to the World Medical Association Declaration of Helsinki and was approved by the Ethics Committee of the Institute of Physiology and Pathology of Hearing (Approval Number IFPS: KB/18/2017). Each participant gave written informed consent for participating in the study. Two persons did not complete the STS, so they were excluded from the analysis. Seventy persons completed the STS twice over a period of 3 days during their diagnostic evaluation for tinnitus. No therapeutic procedures were applied during the diagnostic process.

There were 53 women and 70 men in the group. The patients’ ages ranged from 22 to 81 years old (M = 50.55; SD = 13.13). Table 1 presents education and place of residence. The period of suffering from tinnitus varied from 1 month to 50 years, with an average of 6.06 years. Most frequently, the tinnitus was bilateral (48%), 38.2% reported unilateral tinnitus (left ear 26%, right ear 12.2%), and 12.2% perceived tinnitus as located in their head (1.6% of patients did not answer the question). In 85.4% of patients, tinnitus was continual, while 12.2% suffered from tinnitus periodically (2.4% missing data).

Table 1 Participants’ education and place of residence


All participants were asked to complete the Skarzynski Tinnitus Scale (STS), Beck Depression Inventory (BDI), Tinnitus and Hearing Survey (THS), Tinnitus Handicap Inventory (THI) and Tinnitus Functional Index (TFI). The authors used their own Polish adaptations of THS [14] and THI [15], the Polish version of TFI (as adapted by Wrzosek et al. [16] and purchased on the basis of an agreement between our institution and Oregon Health & Science University, the questionnaire’s rights holder), and the Polish version of BDI [30,31,32]. Every patient completed the questionnaires in the same sequence: STS, BDI, THS, THI, TFI. STS was filled first to eliminate the priming effect related to the possible impact of other questionnaires on completing the new tool. In this way, we tried to keep the measurement reliability as high as possible.

Tinnitus and Hearing Survey

The Tinnitus and Hearing Survey (THS), developed by Henry et al. [33] is a brief tool to determine how much of a patient’s complaint of tinnitus is due specifically to tinnitus (Subscale A) or to hearing problems (Subscale B). The subscale concerning hyperacusis was not used in the analysis. Although tinnitus and hyperacusis can be related phenomena, this construct was not of interest in the current study.

Tinnitus Handicap Inventory

The Tinnitus Handicap Inventory (THI) measures the effects of tinnitus on everyday functioning [18, 19]. Twenty-five items are rated on a 3-point scale (yes, no, sometimes). The higher the score is, the greater the impact on everyday function is. The subscales (functional, emotional, and catastrophic) were used to check the validity of STS.

Tinnitus Functional Index

The Tinnitus Functional Index (TFI) [34] provides a composite measure for evaluating the functional impact of tinnitus and takes into consideration a broad range of symptoms associated with tinnitus severity. The questionnaire has eight subscales: intrusiveness, sense of control, cognition, sleep, auditory, relaxation, quality of life, and emotional. Higher scores reflect greater negative impact on everyday functioning. Fackrell et al. [21] showed that TFI is appropriate for measuring intervention-related change.

Beck Depression Inventory

Beck Depression Inventory is a self-reporting, 21-item inventory used to assess symptoms of depression [30, 31]. Each of the 21 items is rated on a 0–3 point scale. The global score is the sum of all answers and a higher score indicates greater depressive symptoms.

Pure-Tone Audiometry

For all patients, hearing thresholds for air and bone conduction in the right and left ear were determined by an experienced technician at frequencies of 0.125, 0.25, 0.5, 1, 2, 4, and 8 kHz using pure-tone audiometry in a soundproof booth.

Statistical and psychometric analysis

Construct validity was assessed using EFA. It was performed to test factor structure and to assign items to appropriate factors. The factors were extracted using the principal axis method with oblimin oblique rotation (correlation between factors was assumed). The number of factors was decided by considering the cumulative variance explained (a criterion over 50%), eigenvalues (over 1 for each factor), a screen test, and interpretability. A minimum loading of 0.5 for each item was taken as threshold [29, 35].

Convergent validity was assessed using Pearson bivariate correlations with other tinnitus questionnaires. The predefined criterion for strength of association was a correlation between the global scores of STS and THI, and between STS and TFI, of above 0.7 [36]. At the same time, a correlation between the respective subscales of STS and of the other tools (e.g., functioning subscale of STS or functioning subscale of THI) also needed to be above 0.70.

Discriminant validity was assessed using Pearson bivariate correlations with pure-tone audiometry (PTA) results and THS Hearing. The criteria described by Fackrell et al. [13] were used. Weak (or at most moderate) correlations between STS and PTA and between STS and subscale B of THS were expected because hearing problems should not to be related to tinnitus. Additionally, the groups which, in theory, should differ were compared: it was presumed that patients able to cope with tinnitus distress would have lower scores on THI and TFI than patients who had difficulty coping with tinnitus distress. The hypothesis was tested using a t test for paired samples and with a statistical significance threshold p < 0.05.

Reproducibility was gauged in terms of internal consistency (Cronbach’s alpha), reliability (intraclass correlation coefficient), and agreement (limits of agreement and the smallest detectable change—SDC). According to the criterion described by Nunnally and Bernstein [35], internal consistency was considered good when Cronbach’s alpha was above 0.70. To measure the reproducibility of STS, intraclass correlation (ICC) was used with a positive rating above 0.70 [36]. Agreement was assessed by calculating the limits of agreement described by Bland and Altman [37] and 95% scores were expected to be within the identified agreement limits. The SDC was defined as a change beyond measurement error (outside the limits of agreement) in stable patients [29].

Responsiveness was assessed in terms of the number of items exhibiting floor and ceiling effects. These effects were considered to be absent if fewer than 15% of the respondents achieved the lowest possible score (for a floor effect) or highest possible score (for a ceiling effect) [36]. For clinical use, norms for severity of tinnitus as measured by STS were proposed. For statistical analysis, IBM SPSS Statistics and AMOS version 24 were used.


Descriptive statistics

The analysis began with calculating descriptive statistics (Table 2). The means (M) were about 2.0; the highest in the item was concerned with difficulty in sleeping (M = 2.43), and the lowest mean in item 9 was concerned with coping with tinnitus by distracting attention (M = 1.43). The skewness was lower than 1.0, and kurtosis was generally lower than 1.0 too, but in a few cases it was slightly over 1.0.

Table 2 Descriptive statistics of items on the Skarzynski Tinnitus Scale (STS)

Discriminating power of each item

The discriminating power of an item refers to the degree with which it can distinguish between subjects with a high level of a trait and subjects with a low level. This property is related directly to how well the score measures a trait [38, 39]. A corrected item total correlation of more than 0.30 was an acceptable level of item discrimination [35]. Correlations are presented in Table 3. Discriminating power was good for nearly all items, except item 9. This item, concerning coping with tinnitus by distracting attention, not only had the lowest mean but also a very low discriminating power, less than 0.3.

Table 3 Corrected item total correlations

Construct validity

Exploratory factor analysis

Exploratory factor analysis was performed to reveal factor structure. The factors were extracted using the principal axis method with oblimin oblique rotation (a correlation between factors was assumed). The number of factors was decided after consideration of the cumulative variance explained, eigenvalues, scree test, and interpretability.

Exploratory factor analysis was performed twice. The first time all items were taken into account. The KMO measure of sampling adequacy was 0.91, and Bartlett’s test of sphericity was significant (χ2(105) = 987.77; p < 0.001). The three-factor solution explained 63.31% of the variance. The first factor explained 46.99% of variance, the second 9.25%, and the third 7.07%. But, this solution was not satisfactory because of item 9, for which communality was very low (0.15) and item 9 did not load on the same factor as other items concerned with coping. Additionally, internal consistency for the potential “coping” factor was lowered by item 9.

In this situation, EFA was performed again, this time with item 9 excluded. Once more, three factors were extracted. The three-factor solution explained 65.9% of variance. The first factor explained 50.28% of variance, the second 8.66%, and the third 6.95%. For two factors, eigenvalues were bigger than 1; for one factor it was 0.97. Table 4 presents factor loadings.

Table 4 Factor loadings (pattern matrix)

The factorial structure was now clear. Nearly all items were clearly assigned with the exception of item 1, which loaded two factors in a comparable manner. This item concerned emotions (irritation), and it was therefore included in another factor representing items concerned with emotions arising from tinnitus. Table 5 presents correlations between the factors.

Table 5 Factor correlation matrix

Factor 1, including items concerning functioning in everyday life, and factor 3, including items concerning emotions and thoughts, were more strongly correlated with each other than with factor 2 concerning coping, and this additionally confirms content validity. The results of EFA and content analysis of wording of the items justified the following assignment:

Psychological distress subscale: items 1, 4, 7, 8, 10, 14;

functional subscale: items 2, 5, 11, 13, 15;

coping subscale: items 3*, 6*, 12* (*items recoded).

Subscales and global scores

The subscale scores were calculated by summing up the answers to individual questions (0, definitely not; 1, rather not; 2, neither yes nor no; 3, rather yes; 4, definitely yes). The sum was then divided by the maximum score which was theoretically possible to obtain. The resulting scores were on a scale from 0 to 100, where 0 meant no difficulties in a given domain. The total score was calculated by summing up answers from all items and dividing the score by 56 (the maximum possible score). The way of calculating the subscale scores is set out below.

Psychological distress subscale:

((item 1 + item 4 + item 7 + item 8 + item 10 + item 14)/24)*100.

Functional subscale:

((item 2 + item 5 + item 11 + item 13 + item 15)/20)*100.

Coping subscale:

((item 3 + item 6 + item 12)/12)*100. These items, as mentioned before, should be recoded, i.e. 0, definitely yes; 1, rather yes; 2, neither yes nor no; 3, rather not; 4, definitely not.

In Table 6, descriptive statistics of the subscale scores and the global STS score are presented. The mean scores are around 50 points, and patients achieved the highest scores on the Psychological distress subscale, which indicates that the biggest impact of tinnitus is to provoke negative emotions and intrusive thoughts. In Table 7, correlations between scores on the subscales are presented. Correlations between the Psychological distress subscale and the Functional subscale were strong, whereas both subscales were moderately correlated with the Coping subscale. The contribution of the Psychological distress subscale and the Functional subscale to global scores was the highest, while the contribution of the Coping subscale was lower.

Table 6 Descriptive statistics of subscale scores and global STS scores
Table 7 Correlations between STS subscales


Internal consistency

Internal consistency is a measure of the extent to which items in a questionnaire subscale are homogeneous (i.e., correlated), and so tend to measure the same concept. Cronbach’s alpha was calculated for each subscale of the STS separately and for the total score. For the Psychological distress subscale α = 0.91, for the Functional subscale α = 0.84, for the Coping subscale α = 0.62, and for the STS global α = 0.91.

The STS global and Psychological distress subscale had extremely high consistency. The Functional subscale had good internal consistency according to the criterion given by Fackrell et al. [13]. Only the Coping subscale had lower, questionable internal consistency. The internal structure was also analyzed using inter-item correlations (Table 8). All correlations (with the exception of two results for item 5) were statistically significant and positive. Correlations between items forming separate subscales were from moderate to strong.

Table 8 Inter-item correlations


Reliability is a measure of the degree to which subjects can be distinguished from each other based on two testing sessions (test–retest). This property was assessed using intraclass correlation coefficient (ICC), with scores > 0.70 indicating high reliability [36]. ICC was as follows (with 95% CI): for the Psychological distress subscale ICC = 0.93 (0.88–0.96); for the Functional subscale ICC = 0.93 (0.89–0.96); for the Coping subscale ICC = 0.81 (0.69–0.88); and for the STS global ICC = 0.94 (0.90–0.97). As a whole, the STS has excellent reliability similar to Psychological distress and Functional subscales, and the reliability of the Coping subscale was also good.


Agreement relates to absolute measurement error and indicates which scores, on repeated measurement, are close to each other [36]. Agreement was assessed using two methods: the limits of agreement [37] and the SDC. Limits of agreements were calculated as: d ± 1.96* SDdiff, where d is the mean difference between test and retest, and SDdiff is the mean difference of the standard deviations.

The SDC was derived from the standard error of measurement (SEM) between two repeated measures: \({\text{SEM }} = {\text{ SD}}_{\text{diff}} /\sqrt 2\), so that \({\text{SDC}} = 1.96*/\sqrt 2 *{\text{SEM}}\) [29, 36]. Table 9 presents data concerning reliability and agreement between two repeated measures. The SDC scores are different than the limits of agreement, because they are based on SEM consistency, not SEM agreement. For the STS global scores, SDC was 18.43 and was higher than the limit of agreement, which was 14.79.

Table 9 Reproducibility of STS

A little under 95% of the scores for individual subscales were within the identified agreement limits. For STS global scores, there was 95% agreement between scores, as shown in Fig. 1.

Fig. 1
figure 1

Bland–Altman plot of test–retest agreement of STS global scores


Validity was examined by the degree of correlation with other tinnitus questionnaires and by comparing groups expected to differ due to known characteristics. Convergent validity was assessed as Pearson bivariate correlations. The THS (Subscale Tinnitus), the THI, and the TFI measure a similar construct, so strong—or at least moderate—correlations were expected (Table 10).

Table 10 Correlations between STS, THS (Subscale Tinnitus), and THI scores

There was strong and positive correlations between scores on the Psychological distress subscale and the THI Emotional scale, scores on the Functional subscale of STS and the THI Functional scale, and global scores. Also, there was a moderate and positive correlation between THS Hearing and STS global; here, a weaker correlation had been expected because of the low specificity of THS (screening instead of diagnosing specific domains). The weakest correlations occurred between the Coping subscale scores; this is expected since coping is a distinct construct (nevertheless, it should be emphasized that the correlations were significant and positive, and so bigger difficulties in coping with tinnitus were associated with higher THI scores). Table 11 presents correlations between the STS and the TFI scores.

Table 11 Correlations between the STS and TFI scores

There was strong correlations between scores on the Psychological distress subscale and the TFI Emotional and Cognition scales, and a strong correlation between scores on the Functional subscale STS and the TFI Relaxation and Sleep. There was a moderate correlation between the Coping subscale and the Sense of control subscale, although a stronger correlation was expected. Overall, scores were strongly and positively correlated.

Additionally, validity of the Coping subscale was examined by comparing groups that were expected to differ. The groups were selected on the basis of their median scores on the Coping subscale (Me = 41.67). Patients whose scores were lower than the median were assumed to be coping with tinnitus distress, while those whose scores were higher than the median were assumed to be having difficulties in coping with tinnitus distress. Then, the severity of tinnitus, as measured by the TFI and the THI, was compared (Table 12).

Table 12 TFI and THI scores for difficulties in coping with tinnitus distress

The differences between subjects with good coping and poor coping with tinnitus distress were statistically significant. In accordance with our presumption, higher tinnitus severity (measured by both TFI and THI) was indicated in patients who had big difficulties in coping with distress due to tinnitus. This shows that the Coping subscale effectively measures a patient’s ability to cope with tinnitus distress. For discriminant validity, the correlations between the STS and the BDI were assessed (Table 13).

Table 13 Correlations between STS and BDI scores

Correlations between the STS and the BDI scores were moderate, while for STS coping correlation was weak. This shows that the STS measures a construct which is distinct from depression symptoms. Correlation between the STS global score and the BDI was similar to correlation between the TFI global score and the BDI (r = 0.57) reported by Fackrell et al. [21].

Discriminant validity was also examined by looking at correlations between the THS (Hearing subscale) and PTA in terms of average thresholds for the right and left ears (Table 14). There were moderate correlations between STS and THS Hearing. At the same time, there were no correlations between STS and PTA. This shows how difficult it is for patients to distinguish between complaints related to tinnitus and complaints related to hearing loss.

Table 14 Correlations between STS and PTA and THS Hearing


Responsiveness refers to how well the tool is able to detect major changes and it was assessed in terms of the number of items exhibiting floor or ceiling effects.

A floor or ceiling effect is considered to be present if more than 15% of respondents achieved the lowest or highest possible score [36]. To reveal it, frequency distributions of responses for each of the STS items were examined (Table 15). For the majority of items, the responses were rather uniformly distributed. Only for three items (numbers 4, 5, 14), the fraction of lowest possible responses exceeded 15%. Likewise, only for three items (numbers 1, 11, 13) a ceiling effect might be present, with more than 15% choosing the highest possible response.

Table 15 Percentage of responses for the STS items

It also appears that the distribution of scores for subscales and total scores is as significant as the distribution of answers to individual items. Especially in the case of total scores, there is no clear sign of truncated tails (Fig. 2).

Fig. 2
figure 2

Distributions of STS scores


It is very important, especially in clinical practice, to have the opportunity to assign qualitative meanings to the quantitative scores. Both clinicians and researchers want to identify and quantify the severity of tinnitus. For this reason, an attempt was made to define categories and develop a grading system of STS scores indicating various levels of tinnitus severity.

The distribution of global scores was approximately normal (Kolmogorov–Smirnov test: Z (df = 123) = 0.08; p > 0.05) with mean M = 51.18 and SD = 21.18. Next, the transformation of raw scores into Z-scores was made using the formula Z = (XM)/SD, where X is the raw score, Z is the standardized score, M is the mean, and SD is the standard deviation. In accordance with a normal distribution, the majority of scores (68%) were in the range − 1 to + 1 Z-score. Values for − 1 Z and + 1 Z were treated as the limits of the norm, so that:

$$\left( {X{-} 5 1. 1 8} \right)/ 2 1. 1 8 { } = { 1},{\text{ so}}{:} \, + 1 Z \, = { 72}. 3 6 { } \approx { 72}.$$
$$\left( {X{-} 5 1. 1 8} \right)/ 2 1. 1 8 { } = \, {-} 1,{\text{ so}}{:} \, {-} 1 Z \, = { 3}0.$$
  1. 1.

    Scores below 30 can be considered low and indicate mild tinnitus severity and had a slight impact on everyday functioning.

  2. 2.

    Scores between 30 and 51 can be considered moderate and indicate moderate tinnitus severity and there is a noticeable negative impact of tinnitus on everyday functioning.

  3. 3.

    Scores between 51 and 72 can be considered high and indicate escalated tinnitus severity and there is a considerable negative impact of tinnitus on emotional, cognitive, and functional difficulties. They indicate a poorer ability to cope with tinnitus distress.

  4. 4.

    Scores above 72 (extreme) indicate very high tinnitus severity, a high intensity of negative emotions connected with tinnitus, the occurrence of intrusive thoughts, sleep disturbance, and a handicapped ability to cope with tinnitus distress.

These norms may be very useful in clinical practice to define the current condition of the patient, as a benchmark for treatment progress, and to make empirically based clinical decisions such as starting, continuing, or ending treatment.


The assumption while developing the STS questionnaire was to ensure it would be a convenient tool for patients and, at the same time, would enable physicians to assess the impact of tinnitus on the most important domains of a patient’s functioning. From interviews with patients, we concluded that these domains included emotions, thoughts, and everyday activities. Many patients also emphasized that despite suffering from tinnitus they were able to cope with it.

Factor analysis confirmed the existence of three factors corresponding with the above-mentioned domains. This appears to make sense, since emotions, thoughts, and behaviors constitute a general construct, called attitude by psychologists, whereas the perception of illness and its consequences experienced by the patient affect the way in which the patient copes with their condition.

The participants filled in the STS twice, over a period of just 3 days; such a short time frame results from economic reasons (a 3-day diagnostic hospitalization is done under contract to, and paid by, the Polish National Health Insurance). However, Marx et al. [40] found no difference in the stability of results from a 2-day and 2-week test–retest, so applying a 3-day time frame appears justified in terms other than clinical practicality.

Internal consistency of the whole STS was very high, as was its two subscales of Psychological distress and Functional (unlike the Coping subscale). Although some researchers assume that acceptable internal consistency need only to be higher than 0.6 [41], a 0.7 threshold appears more reasonable. The experience acquired while developing the tool clearly indicates that the issue of coping with tinnitus-related distress is complex and requires a more thorough examination, with bigger contributions from psychologists and therapists. The strategies of coping with stress are varied—from active to passive, from a focus on the problem to a focus on emotions, and so on [25, 42]. This issue should be the subject of separate investigations which might lead to the creation of a separate, specialized tool.

Convergent validity of the STS was verified by its correlation with the most commonly used questionnaires—the THI and the TFI. Significantly, no correlation between the STS scores and pure-tone audiometry scores was found, which confirms that difficulties connected with hearing and with tinnitus are different in nature and that the decision not to include the hearing domain in the tool was correct. At the same time, the Coping subscale proved to be meaningful since, clearly, patients who found it hard to cope with tinnitus-related distress showed higher severity of tinnitus (as measured by the TFI and the THI).

Responsiveness was good. Only on three items a minor floor effect appeared (where a little over 15% of respondents achieved the lowest possible score). On three items, a ceiling effect was visible (over 15% achieved the highest possible score). However, for subscale scores, as well as for global scores, floor and ceiling effects did not appear, and so these scores can be treated as sensitive to change.

A great asset of the STS is the fact that it can be used to detect treatment-related changes. For global scores, the SDC was 18.48, which is a value which can be confidently used to reflect real changes (i.e., not due to measurement error). The norms suggested for the STS may turn out to be very useful in clinical practice. The ability to define a patient’s score as low, moderate, high, or extreme is important and allows a diagnostician or a therapist to decide about the manner and scope of treatment.

A limitation of this study is a slight inconsistency in the content of the Psychological distress subscale. During the development of the STS, its three-factor structure was confirmed. The factor which was later labeled the Psychological distress subscale consisted mostly of emotional items, with two additional cognitive items which seem distinct from the others. However, the authors decided to include these cognitive items in this subscale on the basis of item loadings and intracorrelations between items (0.5 and more), but also according to the findings of psychology which show that negative affect and intrusive thoughts are positively associated [43, 44].

For future research, it would be worth defining a minimal important change (MIC) based not just on the score distribution but on an external criterion (in an anchor-based approach) to check if STS is responsive to treatment-related change. It is also desirable to conduct cross-validation in other groups of patients and test the hypothesis about the three-factor structure of STS. This is especially important to further verify the validity of the STS.

Psychometric validation is a continuous process that requires evaluations in various populations to provide evidence that the measurement tool has the appropriate psychometric properties, adequate validity, and reliability. The Additional file contains not only the Polish version (Additional file 1: Appendix S1) but also the English version (Additional file 2: Appendix S2) of the STS. The English version was created in cooperation between an English translator and the authors of STS, allowing English speakers to acquaint themselves with the content of the questionnaire. We encourage other specialists to apply the STS and optimize its use for research and clinical practice.


Skarzynski Tinnitus Scale is a brief and robust tool that is very useful for clinical practice. A great advantage of the STS is that it takes into account the impact of tinnitus on both the psychological (emotional, cognitive) and functional domains and the patient’s ability to cope with tinnitus-related distress. The SDC in global score of 18.43 allows it to be used as a measure of treatment-related change. The STS norms convey a clinical meaning to quantitative scores and are easy to apply in clinical practice.



Beck Depression Inventory


exploratory factor analysis


intraclass correlation coefficient


pure-tone average


smallest detectable change


standard error of measurement


Skarzynski Tinnitus Scale


Tinnitus Functional Index


Tinnitus Handicap Inventory


Tinnitus and Hearing Survey


  1. Jastreboff PJ. Phantom auditory perception (tinnitus): mechanisms of generation and perception. Neurosci Res. 1990;8:221–54.

    Article  CAS  Google Scholar 

  2. Møller AR. Epidemiology of tinnitus in adults. Textb Tinnitus. New York: Springer; 2011. p. 29–37. Accessed 3 Nov 2017.

    Chapter  Google Scholar 

  3. Raj-Koziak D. Szumy uszne u dzieci—przegląd piśmiennictwa. Nowa Audiofonologia. 2016;5:9–14.

    Google Scholar 

  4. Skarżyński PH, Kochanek K, Skarżyński H, Senderski A, Wysocki J, Szkiełkowska A, et al. Hearing screening program in school-age children in western Poland. J Int Adv Audiol. 2011;7:194–200.

    Google Scholar 

  5. Skarzyński PH, Świerniak W, Piłka A, Skarżynska MB, Włodarczyk AW, Kholmatov D, et al. A hearing screening program for children in primary schools in Tajikistan: a telemedicine model. Med Sci Monit. 2016;22:2424–30.

    Article  Google Scholar 

  6. Kennedy V, Wilson C, Stephens D. Quality of life and tinnitus. Audiol Med. 2004;2:1–12.

    Article  Google Scholar 

  7. Langguth B, Landgrebe M, Kleinjung T, Sand PG, Hajak G. Tinnitus and depression. World J Biol Psychiatry. 2011;12:489–500.

    Article  Google Scholar 

  8. Newman CW, Sandridge SA. Tinnitus questionnaires. In: Snow JB, editor. Tinnitus theory manag. Hamilton: BD Decker; 2004. p. 237–54.

    Google Scholar 

  9. Zeman F, Koller M, Langguth B, Landgrebe M. Which tinnitus-related aspects are relevant for quality of life and depression: results from a large international multicentre sample. Health Qual Life Outcomes. 2014;12:7.

    Article  Google Scholar 

  10. Henry JA, McMillan GP, Thielman EJ, Galvez G, Zaugg TL, Porsov E, et al. Evaluating psychoacoustic measures for establishing presence of tinnitus. J Rehabil Res Dev. 2013;50:573–84.

    Article  Google Scholar 

  11. Henry JA, Meikle MB. Psychoacoustic measures of tinnitus. J Am Acad Audiol. 2000;11:138–55.

    CAS  PubMed  Google Scholar 

  12. Seydel C, Haupt H, Olze H, Szczepek AJ, Mazurek B. Gender and chronic tinnitus: differences in tinnitus-related distress depend on age and duration of tinnitus. Ear Hear. 2013;34:661–72.

    Article  Google Scholar 

  13. Fackrell K, Hall D, Barry J, Hoare D. Tools for tinnitus measurement: development and validity of questionnaires to assess handicap and treatment effects. Tinnitus causes treat short long-term health eff. New York: Nova Science Publishers Inc; 2014. p. 13–60.

    Google Scholar 

  14. Raj-Koziak D, Gos E, Rajchel JJ, Piłka A, Skarżyński H, Rostkowska J, et al. Tinnitus and Hearing Survey: a polish study of validity and reliability in a clinical population. Audiol Neurotol. 2017;22:197–204.

    Article  Google Scholar 

  15. Skarzynski PH, Raj-Koziak D, Rajchel JJ, Pilka A, Wlodarczyk AW, Skarzynski H. Adaptation of the Tinnitus Handicap Inventory into Polish and its testing on a clinical population of tinnitus sufferers. Int J Audiol. 2017;56:711–5.

    Article  Google Scholar 

  16. Wrzosek M, Szymiec E, Klemens W, Kotyło P, Schlee W, Modrzyńska M, et al. Polish translation and validation of the Tinnitus Handicap Inventory and the Tinnitus Functional Index. Front Psychol. 2016;7. Accessed 22 Feb 2017.

  17. Rajchel J, Skarżyński PH. Przegląd wybranych narzędzi badawczych do oceny występowania oraz charakterystyki szumów usznych i nadwrażliwości słuchowej. Nowa Audiofonologia. 2016;5:74–88.

    Google Scholar 

  18. Newman CW, Jacobson GP, Spitzer JB. Development of the Tinnitus Handicap Inventory. Arch Otolaryngol Head Neck Surg. 1996;122:143–8.

    Article  CAS  Google Scholar 

  19. Newman CW, Sandridge SA, Jacobson GP. Psychometric adequacy of the Tinnitus Handicap Inventory (THI) for evaluating treatment outcome. J Am Acad Audiol. 1998;9:153–60.

    CAS  PubMed  Google Scholar 

  20. Baguley DM, Andersson G. Factor analysis of the Tinnitus Handicap Inventory. Am J Audiol. 2003;12:31–4.

    Article  CAS  Google Scholar 

  21. Fackrell K, Hall DA, Barry JG, Hoare DJ. Psychometric properties of the Tinnitus Functional Index (TFI): assessment in a UK research volunteer population. Hear Res. 2016;335:220–35.

    Article  Google Scholar 

  22. Heszen I. Psychologia stresu. Warszawa: Wydawnictwo Naukowe PWN; 2015.

    Google Scholar 

  23. Budd RJ, Pugh R. Tinnitus coping style and its relationship to tinnitus severity and emotional distress. J Psychosom Res. 1996;41:327–35.

    Article  CAS  Google Scholar 

  24. Andersson G, Kaldo V, Strömgren T, Ström L. Are coping strategies really useful for the tinnitus patient? An investigation conducted via the internet. Audiol Med. 2004;2:54–9.

    Article  Google Scholar 

  25. Henry JL, Wilson PH. Coping with tinnitus: two studies of psychological and audiological characteristics of patients with high and low tinnitus-related distress. Int Tinnitus J. 1995;1:85–92.

    CAS  PubMed  Google Scholar 

  26. Martz E, Henry JA. Coping with tinnitus. J Rehabil Res Dev. 2016;53:729–42.

    Article  Google Scholar 

  27. Hall DA, Szczepek AJ, Kennedy V, Haider H. Current-reported outcome domains in studies of adults with a focus on the treatment of tinnitus: protocol for a systematic review. BMJ Open. 2015;5:e009091.

    Article  Google Scholar 

  28. Hall DA, Haider H, Szczepek AJ, Lau P, Rabau S, Jones-Diette J, et al. Systematic review of outcome domains and instruments used in clinical trials of tinnitus treatments in adults. Trials. 2016;17:270.

    Article  Google Scholar 

  29. de Vet HCW, Terwee CB, Mokkink LB, Knol DL. Measurement in medicine: a practical guide. Cambridge: Cambridge University Press; 2011.

    Book  Google Scholar 

  30. Beck AT, Ward CH, Mendelson M, Mock J, Erbaugh J. An inventory for measuring depression. Arch Gen Psychiatry. 1961;4:561–71.

    Article  CAS  Google Scholar 

  31. Beck AT, Steer RA, Carbin MG. Psychometric properties of the Beck Depression Inventory: twenty-five years of evaluation. Clin Psychol Rev. 1988;8:77–100.

    Article  Google Scholar 

  32. Parnowski T, Jernajczyk W. Inwentarz Depresji Becka w ocenie nastroju osób zdrowych i chorych na choroby afektywne. Psychiatr Pol. 1977;11:417–21.

    CAS  PubMed  Google Scholar 

  33. Henry JA, Griest S, Zaugg TL, Thielman E, Kaelin C, Galvez G, et al. Tinnitus and hearing survey: a screening tool to differentiate bothersome tinnitus from hearing difficulties. Am J Audiol. 2015;24:66–77.

    Article  Google Scholar 

  34. Meikle MB, Henry JA, Griest SE, Stewart BJ, Abrams HB, McArdle R, et al. The tinnitus functional index: development of a new clinical measure for chronic, intrusive tinnitus. Ear Hear. 2012;33:153–76.

    Article  Google Scholar 

  35. Nunnally JC, Bernstein IH. Psychometric theory. New York: Mc Grraw Hill; 1998.

    Google Scholar 

  36. Terwee CB, Bot SDM, de Boer MR, van der Windt DAWM, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60:34–42.

    Article  Google Scholar 

  37. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet Lond Engl. 1986;1:307–10.

    Article  CAS  Google Scholar 

  38. Lord FM, Novick MR. Statistical theories of mental test scores. USA: Addison-Wesley Publishing; 1968.

    Google Scholar 

  39. McDonald RP. Test theory: a unified treatment. Mahwah: LEA; 1999.

    Google Scholar 

  40. Marx RG, Menezes A, Horovitz L, Jones EC, Warren RF. A comparison of two time intervals for test-retest reliability of health status instruments. J Clin Epidemiol. 2003;56:730–5.

    Article  Google Scholar 

  41. Loewenthal KM. An introduction to psychological tests and scales. 2nd ed. Hove: Psychology Press; 2004.

    Google Scholar 

  42. Budd RJ, Pugh R. The relationship between locus of control, tinnitus severity, and emotional distress in a group of tinnitus sufferers. J Psychosom Res. 1995;39:1015–8.

    Article  CAS  Google Scholar 

  43. Brose A, Schmiedek F, Lövdén M, Lindenberger U. Normal aging dampens the link between intrusive thoughts and negative affect in reaction to daily stressors. Psychol Aging. 2011;26:488–502.

    Article  Google Scholar 

  44. Lynch TR, Schneider KG, Rosenthal MZ, Cheavens JS. A mediational model of trait negative affectivity, dispositional thought suppression, and intrusive thoughts following laboratory stressors. Behav Res Ther. 2007;45:749–61.

    Article  Google Scholar 

Download references

Authors’ contributions

HS, EG and DRK were responsible for study design and preliminary version of the Skarzynski Tinnitus Scale. EG and DRK performed literature review. DRK performed the tinnitus examination and collected patient’s data. EG performed statistical analysis and together with DRK and HS analyzed and interpreted the patient data. EG, DRK and PHS drafted the manuscript. HS performed a critical review of the manuscript and significantly contributed to the manuscript draft and editing. All authors read and approved the final manuscript.


The authors would like to express their gratitude to colleagues from the World Hearing Center: Katarzyna Bienkowska, Beata Dziendziel, Karina Karendys, Lucyna Karpiesz, Justyna Kutyba, Iwona Niedzialek, Karolina Penar, Joanna Rajchel, Izabela Sarnicka and Weronika Swierniak for their help in developing the STS.

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Consent for publication

Not applicable.

Ethics approval and consent to participate

The study was approved by the Ethics Committee of the Institute of Physiology and Pathology of Hearing (Approval Number IFPS: KB/18/2017).


This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Piotr H. Skarżyński.

Additional files

Additional file 1: Appendix S1.

Skala Szumów Usznych Skarżyńskiego.

Additional file 2: Appendix S2.

Skarzynski Tinnitus Scale.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Skarżyński, H., Gos, E., Raj-Koziak, D. et al. Skarzynski Tinnitus Scale: validation of a brief and robust tool for assessing tinnitus in a clinical population. Eur J Med Res 23, 54 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: