Background: Rapid assessment of stroke severity is critical in emergency settings to guide timely management. The National Institutes of Health Stroke Scale (NIHSS) is widely used, but its diagnostic limitations—particularly in detecting non-dominant hemisphere syndromes and aphasia—warrant evaluation against alternative tools such as the Scandinavian Stroke Scale (SSS). Objectives: To compare the diagnostic accuracy of the NIHSS and SSS in assessing early stroke severity among adult patients presenting to the emergency medicine department, using CT brain imaging as the reference standard. Methods: This cross-sectional analytical study was conducted at the Emergency Medicine Department of Bharati Hospital, Pune, over a one-year period (January 2024–January 2025). Adults (≥18 years) with suspected ischemic stroke were assessed using both NIHSS and SSS at presentation. Stroke severity was categorized into mild, moderate, and severe for each scale. CT imaging served as the reference standard. Sensitivity, specificity, and positive predictive value (PPV) were calculated for each scale. Results: A total of 306 patients were included. The NIHSS classified 39.5% as mild, 40.5% moderate, and 19.9% severe, while SSS categorized 38.2% as mild, 40.8% moderate, and 20.9% severe. Cohen’s Kappa between the two scales was 0.567, indicating moderate agreement. In patients with classifiable CT findings (n = 13), SSS showed higher sensitivity (62.5%) and PPV (83.3%) compared to NIHSS (sensitivity 37.5%, PPV 75.0%). Both tools had equal specificity (80.0%). Conclusions: While NIHSS and SSS are both effective in early stroke severity assessment, the SSS demonstrated greater sensitivity and predictive value when compared to CT imaging. These findings suggest that SSS may offer a more sensitive alternative for early stroke triage in emergency settings
Stroke is one of the leading causes of adult disability and mortality worldwide, with ischemic stroke accounting for the majority of cases. Early and accurate assessment of stroke severity plays a pivotal role in determining prognosis, guiding treatment decisions, and streamlining triage in emergency settings [1].
The National Institutes of Health Stroke Scale (NIHSS) is one of the most widely used tools for stroke assessment due to its standardized format and validation in both clinical and research settings [2]. Initially designed for use by neurologists, the NIHSS has since been extended to non-neurologist healthcare providers, demonstrating acceptable reliability and ease of application in acute care environments [2,3]. Despite its widespread use, the NIHSS has limitations—particularly in detecting less overt symptoms such as aphasia, cognitive deficits, or right-hemisphere syndromes [4].
Alternative tools, such as the Scandinavian Stroke Scale (SSS), offer simplified scoring structures and a more balanced assessment of motor, consciousness, and speech functions. Although less commonly used than the NIHSS, the SSS has shown promise in outcome prediction and may be more practical in certain emergency or non-specialist settings. It also allocates more explicit scoring for speech and eye movement, potentially increasing sensitivity to deficits like aphasia [8].
Aphasia, a common and often underrecognized consequence of ischemic stroke, has been linked to poor short- and long-term outcomes [5]. While comprehensive batteries such as the Western Aphasia Battery (WAB-R) [6] and the Comprehensive Aphasia Test (CAT) [7] exist for in-depth language assessment, their use is impractical in emergency settings. Therefore, the choice of an initial severity scale that adequately captures language impairment is crucial. Screening tools like the Language Screening Test (LAST) have been developed for this purpose [8], but are often used alongside broader severity scales, not in isolation.
Given the evolving epidemiology of stroke and the increasing emphasis on time-sensitive interventions, there is a need to compare practical, bedside scales that can be administered at the point of care by frontline physicians [8]. While the NIHSS continues to serve as the gold standard in many institutions, real-world data on its comparative accuracy with alternatives like the SSS in emergency department (ED) triage settings remains limited.
Aims and Objectives
Aim
To evaluate and compare the diagnostic accuracy of the National Institutes of Health Stroke Scale (NIHSS) and the Scandinavian Stroke Scale (SSS) in assessing early stroke severity at the point of entry into the emergency medicine department.
Objectives
This cross-sectional analytical study was conducted at the Emergency Medicine Department of Bharati Hospital and Research Centre, Pune, over a period of one year, from January 2024 to January 2025. The study aimed to evaluate and compare the diagnostic accuracy of the National Institutes of Health Stroke Scale (NIHSS) and the Scandinavian Stroke Scale (SSS) in assessing stroke severity at the time of patient presentation.
All adult patients aged 18 years and above presenting to the Emergency Medicine Department with symptoms suggestive of acute ischemic stroke were considered for inclusion. Patients were excluded if they had already been diagnosed or treated for stroke at another facility, were intubated prior to arrival, or were sedated or paralyzed, thereby limiting accurate neurological assessment.
Eligible patients were evaluated immediately upon arrival using the departmental stroke assessment protocol, which included initial vital signs (blood pressure, pulse, respiratory rate, SpO₂, body temperature), neurological status (Glasgow Coma Scale), blood sugar measurement, and electrocardiography. Stroke severity was assessed using both the NIHSS and SSS scoring systems. Each patient was evaluated by two trained and certified emergency medicine residents independently, both of whom were specifically trained in stroke scale application to minimize inter-observer variability.
Non-contrast computed tomography (CT) brain imaging was performed for all patients as per standard emergency stroke protocol. CT reports were reviewed and categorized into mild, moderate, or severe stroke based on radiological findings including infarct size, location, presence of haemorrhagic transformation, mass effect, or midline shift.
All clinical and radiological data were entered prospectively into a structured Excel spreadsheet and later analyzed using Python and statistical tools for descriptive analysis, confusion matrices, and diagnostic performance metrics. Stroke severity scores were categorized into mild, moderate, or severe based on established thresholds for both NIHSS and SSS. CT imaging findings were used as the reference standard to calculate sensitivity, specificity, and positive predictive value (PPV) for each scale.
The study was approved by the Institutional Ethics Committee prior to initiation. Informed consent was obtained from all patients or their legally authorized representatives prior to participation.
Baseline Characteristics of the Study Population
A total of 306 patients were included in the study. The mean age was 59.7 years (SD ± 14.9), with a range from 29 to 91 years. The mean Glasgow Coma Scale (GCS) on presentation was 13.6 (SD ± 3.1).
Vital parameters at the time of emergency department arrival included:
Of the 306 patients initially included, 221 patients (145 males and 76 females) had valid demographic data for analysis. Among them:
These findings reflect a population with high cardiovascular risk factors, aligning with known ischemic stroke profiles.
Table1a: Numerical Baseline Characteristics
Parameter |
Mean |
SD |
Min |
Max |
Age |
59.91228 |
14.92036 |
29 |
91 |
GCS |
13.51818 |
2.511096 |
3 |
15 |
Systolic BP |
161.6393 |
32.60386 |
90 |
260 |
Diastolic BP |
93.96347 |
20.79384 |
40 |
170 |
Pulse |
85.38813 |
12.95405 |
49 |
125 |
Random Sugar |
160.7419 |
59.84647 |
92 |
450 |
Table1b: Categorical Baseline Characteristics
Parameter |
Count |
Percentage (%) |
Sex (Male) |
145 |
65.6 |
Sex (Female) |
76 |
34.4 |
Hypertension |
131 |
59.3 |
Diabetes Mellitus |
56 |
25.3 |
Smoking |
39 |
17.6 |
Alcohol |
57 |
25.8 |
Tobacco |
46 |
20.8 |
Dyslipidaemia |
7 |
3.2 |
All 306 patients underwent stroke severity assessment at the time of presentation using both the National Institutes of Health Stroke Scale (NIHSS) and the Scandinavian Stroke Scale (SSS).
The mean NIHSS score was 8.6 ± 7.0, with a minimum of 0 and a maximum of 42. Based on NIHSS classification:
The mean SSS score was 40.1 ± 15.4, with values ranging from 0 to 58. Based on SSS severity cut-offs:
The distribution of stroke severity categories across the two scoring systems was broadly similar, though small differences were noted in the classification of borderline cases.
Figure1: Stroke Severity Category Distribution: NIHSS vs SSS
Figure 1. Distribution of stroke severity categories classified by the NIHSS and Scandinavian Stroke Scale (SSS). The bar chart compares the number of patients categorized as mild, moderate, or severe by each scoring system.
To assess the level of agreement between the NIHSS and Scandinavian Stroke Scale (SSS) in classifying stroke severity, both scores were categorized into mild, moderate, and severe categories using validated thresholds. A cross-tabulation was performed to evaluate classification concordance between the two scales.
While a high degree of overall agreement was observed, certain discrepancies emerged in the classification of borderline moderate-to-severe cases. The detailed distribution is presented in the confusion matrix below.
Table2: NIHSS vs SSS Confusion Matrix
NIHSS Severity |
Mild |
Moderate |
Severe |
Mild |
89 |
30 |
2 |
Moderate |
26 |
84 |
14 |
Severe |
2 |
11 |
48 |
The level of agreement between NIHSS and SSS severity classifications was assessed using Cohen’s Kappa coefficient. The analysis yielded a Kappa value of 0.567, which indicates a moderate level of agreement beyond chance.
This interpretation aligns with the Landis and Koch classification, where a Kappa value between 0.41 and 0.60 is considered moderate agreement.
Among the 306 patients assessed, classification agreement between the NIHSS and Scandinavian Stroke Scale (SSS) was evaluated using a three-level severity categorization (mild, moderate, severe). The highest concordance was observed in the mild and severe categories, with 89 patients consistently labelled as mild and 48 as severe by both scales. However, discrepancies were noted particularly in the moderate group, where classification varied between the two scoring systems.
The level of agreement between the two scales was further quantified using Cohen’s Kappa, which yielded a value of 0.567, indicating moderate agreement beyond chance.
To evaluate the diagnostic performance of the NIHSS and Scandinavian Stroke Scale (SSS) in assessing early stroke severity, both scoring systems were compared against radiological findings from non-contrast CT brain imaging, considered the reference standard in acute stroke evaluation.
CT brain impressions were reviewed and categorized into three levels of severity—mild, moderate, and severe—based on radiologist-reported findings such as infarct size, presence of haemorrhage, midline shift, or mass effect.
For diagnostic accuracy analysis, severity levels were further dichotomized into non-severe (mild) and clinically significant (moderate to severe) categories. Sensitivity, specificity, and positive predictive value (PPV) were calculated for each scale using CT imaging as the gold standard.
Table3a: NIHSS vs CT Imaging Confusion Matrix
NIHSS Prediction |
0 |
1 |
0 |
4 |
5 |
1 |
1 |
3 |
Table3a: NIHSS vs CT Imaging shows how many patients were classified as having clinically significant stroke (1) or not (0) by NIHSS compared to CT results.
Table3b: SSS vs CT Imaging Confusion Matrix
SSS Prediction |
0 |
1 |
0 |
4 |
3 |
1 |
1 |
5 |
Table3b: SSS vs CT Imaging shows how many patients were classified as having clinically significant stroke (1) or not (0) by SSS compared to CT results.
Table4: Diagnostic Accuracy of NIHSS and SSS Compared to CT Brain Imaging
Metric |
NIHSS |
SSS |
Sensitivity (Recall+) |
0.375 |
0.625 |
Specificity (Recall-) |
0.8 |
0.8 |
PPV (Precision+) |
0.75 |
0.833333 |
Note: CT imaging was used as the reference standard to define stroke severity. “Positive” indicates moderate-to-severe stroke on CT; “Negative” indicates mild stroke. Sensitivity refers to the scale’s ability to correctly identify patients with clinically significant stroke, while specificity refers to its ability to correctly identify those without. PPV indicates the probability that patients classified as positive truly had moderate/severe findings on imaging.
To assess the diagnostic accuracy of NIHSS and SSS, CT imaging findings were used as the reference standard. Among patients with interpretable CT results, NIHSS demonstrated a sensitivity of 37.5% and a specificity of 80.0%, with a positive predictive value (PPV) of 75.0%.
In comparison, the Scandinavian Stroke Scale (SSS) showed superior sensitivity at 62.5%, the same specificity (80.0%), and a higher PPV (83.3%).
These findings suggest that while both tools are specific, the SSS may be more sensitive in detecting moderate-to-severe stroke severity as identified by early CT brain imaging.
To facilitate direct comparison between the two scoring systems, a summary table was compiled to present the key diagnostic performance metrics of the NIHSS and Scandinavian Stroke Scale (SSS), using CT imaging as the reference standard.
This summary highlights that while both tools demonstrate similar specificity (80.0%), the SSS outperformed NIHSS in sensitivity (62.5% vs. 37.5%) and in positive predictive value (83.3% vs. 75.0%).
Table5: Summary of Diagnostic Performance
Metric |
NIHSS |
SSS |
Sensitivity (Recall+) |
37.50% |
62.50% |
Specificity (Recall-) |
80.00% |
80.00% |
PPV (Precision+) |
75.00% |
83.30% |
Table 5 shows Summary of sensitivity, specificity, and positive predictive value (PPV) for NIHSS and Scandinavian Stroke Scale (SSS), using CT imaging as the reference standard.
In summary, both the NIHSS and Scandinavian Stroke Scale demonstrated comparable ability to classify stroke severity in patients presenting to the emergency department. While agreement between the two tools was moderate, the SSS showed higher sensitivity and positive predictive value when benchmarked against CT imaging findings. These results suggest that the SSS may offer a more sensitive point-of-care assessment tool for early detection of clinically significant ischemic stroke in acute emergency settings.
This study evaluated and compared the diagnostic accuracy of the National Institutes of Health Stroke Scale (NIHSS) and the Scandinavian Stroke Scale (SSS) in assessing early stroke severity in patients presenting to the Emergency Medicine Department. Both scales are widely used tools designed for rapid clinical evaluation; however, their comparative effectiveness in acute emergency settings continues to be a subject of investigation [10].
Patient Profile and Stroke Severity Trends
The baseline characteristics of the study population reflect a typical ischemic stroke demographic, with a mean age of 59.7 years and high prevalence of vascular risk factors including hypertension (59.3%) and diabetes mellitus (25.3%). These findings align with established stroke epidemiology and reinforce the importance of early risk stratification in emergency settings [9].
The distribution of stroke severity across both scales was relatively balanced, with NIHSS classifying 39.5% of patients as mild, 40.5% moderate, and 19.9% severe. The SSS showed a similar pattern (38.2% mild, 40.8% moderate, 20.9% severe), underscoring both tools’ utility in identifying varying levels of stroke severity at presentation.
Agreement Between NIHSS and SSS
The agreement between the two scales was moderate, with a Cohen’s Kappa of 0.567, indicating consistency in severity classification but also highlighting discrepancies, particularly in moderate stroke cases. The highest agreement was observed in the mild and severe categories, which is clinically significant given that these groups often guide treatment urgency and imaging prioritization.
These findings are in line with prior research showing that both tools have acceptable inter-rater reliability, but may diverge in cases where neurological deficits are borderline or more complex to interpret [10,12]. Askim et al. [12] previously reported comparable predictive value for 3-month outcomes between the two scales, reinforcing the clinical relevance of both tools despite moderate classification differences.
Diagnostic Accuracy Compared to CT Imaging
Using CT brain imaging as the reference standard, the SSS demonstrated superior sensitivity (62.5%) compared to the NIHSS (37.5%), while both showed equal specificity (80.0%). Additionally, the positive predictive value (PPV) was higher for SSS (83.3%) than NIHSS (75.0%). These findings suggest that the SSS may be more reliable in identifying patients with moderate-to-severe radiological involvement, potentially making it a more sensitive bedside tool in emergency stroke triage.
Similar observations were made in comparative studies evaluating prognostic accuracy, where the CSS (an earlier version of SSS) demonstrated strong predictive capacity for infarct extent and outcomes [13]. Furthermore, Pedersen et al. emphasized that stroke severity correlates closely with lesion size and type on imaging, supporting the validity of using CT as a reference for scale evaluation [9].
NIHSS Limitations in Detecting Aphasia and Specific Deficits
One explanation for the lower sensitivity of NIHSS in this study may be its limited sensitivity to certain deficits, such as aphasia, especially in early or subtle presentations. Studies have noted that NIHSS may underdiagnose language impairments or misclassify patients with non-dominant hemisphere strokes [11,14,15]. Grönberg et al. [11] specifically highlighted NIHSS’s limitations in detecting aphasia with high precision, a finding echoed in systematic reviews [15,16].
Given the clinical and psychosocial impact of aphasia on recovery and quality of life [17], early recognition is vital. This limitation may partially explain why SSS, which distributes scoring more evenly across motor, speech, and consciousness parameters, performed better in capturing moderate-severe radiologic involvement.
Clinical Implications
In busy emergency departments, where time-to-treatment is critical, the availability of a sensitive and simple severity scale can significantly enhance early triage decisions. While NIHSS remains widely used, this study supports the SSS as a practical alternative or complement, especially when neurological findings are subtle or CT access is delayed.
Given its simplicity and rapid administration, SSS may be particularly useful in non-specialist or resource-limited settings, without compromising diagnostic value. These insights can inform both clinical practice and training protocols in emergency stroke care.
Strengths and Limitations
A key strength of this study is the direct comparison of two validated stroke scales against radiological findings in real-world emergency presentations. The structured data collection by trained emergency medicine residents adds to the reliability of the assessments.
However, the diagnostic accuracy analysis was limited to a smaller subset (n = 13) with clearly interpretable CT severity classifications, due to variability in the CT report language. Additionally, long-term functional outcomes (e.g., mRS at 3 months) were not assessed and should be included in future studies for outcome correlation.
This cross-sectional analytical study compared the diagnostic accuracy of the National Institutes of Health Stroke Scale (NIHSS) and the Scandinavian Stroke Scale (SSS) in assessing early stroke severity at the point of entry into the emergency medicine department.
Both tools demonstrated comparable classification patterns; however, the SSS exhibited higher sensitivity and positive predictive value when benchmarked against CT imaging, the radiological gold standard. While NIHSS remains a well-established and widely used scale, these findings suggest that the SSS may serve as a more sensitive tool for identifying clinically significant ischemic strokes—particularly in time-sensitive or resource-constrained environments.
Given its ease of use, balanced scoring across neurologic domains, and stronger correlation with imaging-defined stroke severity, the Scandinavian Stroke Scale may be considered a practical and reliable alternative to NIHSS for early stroke assessment in emergency settings.