Generate RL dataset for clinical coding
Given a simulated clinical note (discharge summary, operative report), the agent must assign the correct combination of ICD-10 codes from the ~70,000 code taxonomy. Reward is based on coding accuracy, specificity, and compliance with CMS bundling/unbundling rules.
Sample row
{ "clinical_note": "72 y/o male, CABG x3, discharged POD 5...", "expected_codes": ["Z95.1", "I25.10", "Z87.74"], "reward_criteria": { "primary_accuracy": 0.85, "specificity_bonus": 0.10, "bundling_compliance": true } }