Insights/Research & Build-in-Public

Designing GRADEguru: Lessons from Early Institutional Pilots

Building software for institutions is fundamentally different from building for individuals. These are the lessons from our early pilots.

Research & Build-in-PublicInnovgeist · January 2025 · 8 min read

Building for Institutions

Most software is designed for individuals or small teams where a single user makes adoption decisions quickly. Institutional software is different. The decision-making is distributed, the stakeholder interests diverge, the procurement cycles are long, and the expectations for reliability and longevity are higher.

When we began piloting GRADEguru with institutional partners, we encountered all of this — and several things we hadn't anticipated.

In institutional environments, the user is rarely the buyer, and the buyer rarely understands the user's problem.

What the Pilots Taught Us

The most important design decisions aren't technical

Early in our pilot process, we spent significant time optimizing the AI assessment engine — the core of what GRADEguru does. What moved the needle in pilot outcomes was something else entirely: how instructors were onboarded, how the platform integrated with existing assessment workflows, and how clearly we communicated what the AI was doing and what it wasn't.

Instructors didn't distrust the AI because it was bad. They distrusted it because they didn't understand it. Transparency and explainability turned out to be foundational product requirements, not nice-to-haves.

Context variation is larger than expected

Two institutions running the same curriculum will have dramatically different pedagogical approaches, assessment philosophies, and instructor behaviors. A platform that works well in one context can feel entirely wrong in another — not because the core function changed, but because the surrounding context is different.

This pushed us toward a more configurable architecture — not endless settings, but considered flexibility at the points that matter most for different institutional contexts.

Students notice things administrators don't

  • Feedback timing matters more than feedback length — late, thorough feedback is less valuable than timely, focused feedback
  • The framing of AI-generated feedback affects how it's received and acted upon
  • Students will engage more deeply with a platform that demonstrates it understands their individual progress
  • Trust is built through consistency over time, not through feature richness

Platform vs. System

One of the central realisations from our pilot work: an Academic Operating System isn't a product you install. It's a system you integrate — with curriculum design, instructor workflows, assessment cycles, grading policies, and institutional data architecture.

This changes the nature of the deployment. The question isn't only 'does the software work?' It's 'does this system fit the institution's operational reality?' — and 'how does it change that reality for the better?'

The system boundary for an Academic Operating System extends far beyond the software itself.

What This Changes About How We Build

  • Research partnerships precede product decisions — we validate with real instructors and students before building
  • Pilot data informs architecture, not just feature prioritization
  • Transparency and explainability are core requirements, not UX improvements
  • We design for the instructor workflow, not just the student experience
  • Integration with existing institutional systems is treated as a first-class problem

Perspective

GRADEguru continues to evolve through active research partnerships with institutions committed to improving learning outcomes. Each pilot cycle produces findings that directly shape the platform's direction.

Closing Thought

The most important lesson from two years of institutional pilot work: technology that genuinely serves learners must be built from an understanding of how learning actually happens — in the specific, messy, context-dependent way it happens in real institutions.

Everything else is infrastructure. Important infrastructure. But infrastructure in service of something more fundamental.