In this lab you will apply the survival analysis concepts from the Week 7 lecture to a multivariate clinical dataset (Lab4-Multivariate-Clinical.csv, ~290 patients). You will work through the full analytical pipeline: importing and understanding the data, exploring covariates with PivotTables, building life tables, plotting survival curves, comparing treatment groups, and interpreting the results.
Estimated time: 120 minutes
Tools: Microsoft Excel (2016 or newer)
File: Lab4-Multivariate-Clinical.csv
| Column | Type | Description |
|---|---|---|
Patient_ID |
Text | Unique patient identifier |
Age |
Number | Age at enrollment (years) |
Gender |
Text | Female / Male |
Treatment_Arm |
Text | Standard / Intensive |
Severity_Score |
Number | Disease severity at baseline (0–100; higher = more severe) |
Baseline_CRP |
Number | C-Reactive Protein at baseline (mg/L; inflammatory biomarker) |
Region_Clinic |
Text | North / Central / South |
Time_Months |
Number | Follow-up time in months |
Event_Status |
Number | 1 = event occurred; 0 = censored (no event by end of follow-up) |
Clinical context: Patients with a chronic condition were randomized to Standard or Intensive treatment. Time_Months records how long each patient was followed. Event_Status = 1 means the outcome of interest was observed; 0 means the patient was still event-free when follow-up ended (right-censored).
Lab4-Multivariate-Clinical.csv. Preview the data to confirm columns.Tbl_Clinical (Table Design > Table Name).On a new sheet named Summary, answer:
=COUNTA(Tbl_Clinical[Patient_ID])=COUNTIF(Tbl_Clinical[Event_Status], 1)=COUNTIF(Tbl_Clinical[Event_Status], 0)= Events / Total * 100Goal: Before building survival curves, understand who is in the study. Covariates can influence outcomes and are essential context.
Treatment_Arm. Columns: Gender. Values: Count of Patient_ID.Treatment_Arm.Age, Average of Severity_Score, Average of Baseline_CRP.Region_Clinic, Values = Average of Baseline_CRP.Write 3–4 sentences summarizing what you learned from Tasks 2.1–2.3. Are the groups comparable? Any imbalances worth noting?
Goal: Build a grouped life table by hand in Excel and calculate cumulative survival.
| Interval | Start | End | N_at_Risk | Events | Censored | Interval_Survival | Cumulative_Survival |
|---|---|---|---|---|---|---|---|
For each interval, use COUNTIFS to count:
Events in interval [Start, End]:
=COUNTIFS(Tbl_Clinical[Time_Months], ">"&Start, Tbl_Clinical[Time_Months], "<="&End, Tbl_Clinical[Event_Status], 1)
Censored in interval [Start, End]:
=COUNTIFS(Tbl_Clinical[Time_Months], ">"&Start, Tbl_Clinical[Time_Months], "<="&End, Tbl_Clinical[Event_Status], 0)
For the first interval (0–12), use ">="&0 or ">=0" for the lower bound.
N_at_Risk = total number of patients.= Previous N_at_Risk − Previous Events − Previous Censored.1 − (Events / N_at_Risk)= Previous Cumulative_Survival × Current Interval_SurvivalGoal: Compare survival between Standard and Intensive treatment arms.
On the Survival sheet (or a new sheet Stratified), create two life tables side by side:
Use the same intervals (0–12, 12–24, 24–36, 36–48, 48–60).
For the COUNTIFS formulas, add one more criterion:
Tbl_Clinical[Treatment_Arm], "Standard" (or "Intensive").
Calculate N_at_Risk, Events, Censored, Interval_Survival, and Cumulative_Survival for each arm separately.
Write 2–3 sentences answering: 1. Which treatment arm appears to have better survival? 2. At what time point does the difference become most visible? 3. Can we conclude one treatment is statistically better from this visual alone? Why or why not?
Goal: Visualize the relationship between disease severity and follow-up time, distinguishing events from censored observations.
=IF([@Event_Status]=1, "Event", "Censored").Severity_Score. Y-axis: Time_Months.Write 2–3 sentences: 1. Is there a visible relationship between severity and time? 2. Where do the censored observations tend to cluster? What does this tell you?
Write a short memo (half a page to one page) covering:
Submit one workbook StudentName_ID_Lab4.xlsx containing:
| Sheet | Content |
|---|---|
Tbl_Clinical (or Data) |
Imported dataset as Excel Table |
Summary |
Quick counts (total, events, censored, event rate) |
Exploration |
PivotTables (Tasks 2.1–2.3) + bar chart + written interpretation |
Survival |
Overall life table + overall survival curve |
Stratified |
Two stratified life tables + comparison chart + interpretation |
Scatter |
Severity vs. Time scatter + interpretation |
Plus the short memo (in a text box on a sheet, or as a separate Word/PDF upload).
Submit via the LMS.
Use your own Excel file; course datasets are for learning only.