Back to the mapChapterEvaluation & calibrationRubric design, LLM-as-judge grading, gaming resistance, and real-time signal extraction.Tech leadLive work-skills calibration with merit-based XPCalibration system that maintains a live read of how each user actually performs on five work-skills and turns it into XP that ladders up toward real job-readiness.Read moreMajor contributorBehavioral state detection that drives manager modeReal-time classifier that reads confusion, stress, disengagement, and flow from activity signals and switches the manager NPC between supportive and Socratic mode accordingly.Read moreTech leadManager-learner relationship calibrationPer-learner running read of the working relationship with each manager NPC, so the manager treats the tenth conversation differently from the first.Read moreSole authorStage submission grading with anti-gamingSubmission system that extracts the user's artefact, grades it against the stage rubric, blocks resubmit-gaming, and ships back a verdict with a celebratory tagline and an XP award when the user passes.Read moreSole authorEnd-of-scenario performance reviewReview system that steps back at the end of a whole scenario and names the pattern across every stage, conversation, and submission the user moved through.Read moreSole authorPre-stage understanding gatesTwo quick understanding checks that catch a misread of the assignment before it turns into days of wrong work.Read moreTech leadPer-learner journey narrativeRunning per-learner story that compounds across every touchpoint so the rest of the platform knows how the user is actually moving through their focus, not just where they passed or failed.Read moreMajor contributorEval observability: run log and cohort analyticsTwo observability tools that watch every evaluation the platform runs: a per-call run log and a cohort analytics view that reads the log to surface where the curriculum is breaking down.Read moreMajor contributorNPC walkthrough of the stage submission verdictRealtime feedback flow where the user's manager NPC walks them through the stage-submission verdict with audio, evidence highlights on the original file, and a lazy positives sequence.Read more