< Data Detective >
What This Is
You have a real dataset from the CDC. It tracks fine particulate matter (PM2.5) — the tiny particles that get into your lungs and bloodstream — across five localities near Charlottesville from 2003 to 2011.
The localities: Albemarle County, Charlottesville city, Fluvanna County, Greene County, and Nelson County.
Nine years of monthly air quality readings. About 500 rows. Messy government formatting.
Your job: use Claude to go from “I have a CSV” to “I have something I could show a policymaker” — in three steps.
Download the Dataset
This is real data pulled directly from the CDC WONDER database — the same system you’ll use throughout your career to look up mortality rates, environmental exposures, birth outcomes, and more. The file has some quirks (a text footer, “Total” rows mixed in, government-style formatting). That’s on purpose. Real data is messy. Claude handles it.
Step 1: Understand
Upload the CSV to Claude and send this:
Read what Claude tells you. Notice how it spots the footer rows, the “Total” rows, the encoding quirks. This is what it means to have an AI collaborator handle the data wrangling so you can focus on the questions that matter.
Then ask Claude whatever you’re curious about. Some ideas — but follow your own instinct:
Spend a few minutes exploring. Let Claude make charts if it offers to. Ask follow-up questions. Find a thread that interests you — a pattern, a comparison, a surprise in the data.
Step 2: Prototype
Once you’ve found something interesting, ask Claude to help you visualize it:
Claude will generate a visualization and describe the concept. If you don’t love it, push back:
- “Make it simpler — a clinic receptionist needs to understand this in 5 seconds.”
- “Focus on the seasonal pattern instead.”
- “What if we compared just Charlottesville vs. Albemarle?”
This is design thinking. You’re iterating on a prototype using natural language.
Step 3: Communicate
Now turn your finding into something actionable:
Read the output. If it feels too generic, tell Claude:
You’re Done
Post your output to #hds-general on Slack. Share whichever piece you’re most proud of — the chart, the brief, or even just a screenshot of an interesting exchange with Claude. Add one sentence about what surprised you.
Don’t worry about getting it “right.” The point is speed, curiosity, and seeing what’s possible when domain knowledge meets AI. We’ll discuss everyone’s findings on Day 1.
Want More?
Go to CDC WONDER and pull a dataset on something you care about — opioid mortality, infant birth weight, lead exposure, whatever connects to your future specialty. Run through the same three steps. Bring it Monday.