Proposal: AI Copilot for AiiDA (Multi-Agent + RAG)

Unnati_Kadam · February 26, 2026, 9:36am

Hi everyone,

After reading this year’s GSoC proposal on implementing a natural language interface for AiiDA using multi-agent AI, I’ve been thinking about a possible direction and would love to get early feedback.

Rather than approaching this as a simple chat-based wrapper around CLI commands, I’m considering framing it as an “AI Copilot” for AiiDA, an intelligent assistant that integrates into the workflow lifecycle and assists users in a structured, architecture-aware way.

AiiDA has a rich architecture (engine, workflows, provenance graph, plugins), and a natural language interface should ideally:

Respect provenance integrity
Avoid unsafe or careless code generation
Integrate cleanly with existing abstractions
Be maintainable and modular

Given the team’s concerns about low-effort AI-generated contributions, I think this project should focus heavily on architectural design and validation mechanisms rather than just LLM integration.

High-Level Concept

The AI Copilot would assist users in:

Workflow Design
- Generate WorkChain skeletons
- Suggest input structures
- Guide plugin usage
Debugging
- Inspect failed processes
- Parse logs
- Explain likely causes
- Suggest corrective actions
Provenance Exploration
- Translate natural language into structured QueryBuilder queries
- Summarize provenance graphs
- Explain data lineage
Optimization & Suggestions
- Detect common misconfigurations
- Suggest improvements based on past runs

Proposed Architecture

Instead of a single LLM, I’m considering a modular agent structure:

Intent Agent

Classifies user requests (create workflow, debug, inspect provenance, etc.).

Context / Retrieval Agent (RAG)

Retrieves relevant information from:

AiiDA documentation
Local workflow code
Execution logs
Node metadata / provenance graph

This would ground responses and reduce hallucinations.

Execution / Generation Agent

Generates:

WorkChain templates
QueryBuilder queries
Explanations
Debugging suggestions

All outputs would be structured and constrained to AiiDA’s abstractions.

Validation / Safety Layer

Checks:

Compatibility with AiiDA’s architecture
Proper engine usage (run, submit, etc.)
Schema correctness

This layer would help address concerns around uncontrolled LLM-generated code.

Preliminary Technical Goals

If this direction aligns with the team’s expectations, I would aim to:

Draft a more detailed architectural design (components, data flow, extension points)
Define strict output schemas for generated code
Prototype a minimal but well-structured implementation

Open Questions

I’d really appreciate feedback on this. Looking forward to your thoughts — and excited about the possibility of building with AIIDA.

Thanks!
Unnati Kadam

geiger_j · March 27, 2026, 4:21pm

Hi Unnati,

Thanks for your interest and for sharing your thinking!

As I just mentioned in the official GSoC thread, we are not providing individual feedback on proposal directions before the portal closes. Please submit your application via the GSoC portal before March 31st at 18:00 UTC, covering all points in our application requirements.

We also ask that GSoC-related discussions be kept in the main GSoC thread rather than separate topics.

The AiiDA team

Topic		Replies	Views
New Era of AiiDA Developer question , aiida , gsoc	7	142	March 27, 2026
Google Summer of Code (GSoC) 2026 Announcements gsoc	21	627	May 2, 2026
Looking to get involved early in the GSoC 2026 AI project New to AiiDA question , discussion	1	181	February 24, 2026
Google Summer of Code (GSoC) 2025 Announcements discussion	10	619	March 28, 2025
Question about GSoC 2026 General Usage question	4	235	February 9, 2026

Proposal: AI Copilot for AiiDA (Multi-Agent + RAG)

High-Level Concept

Proposed Architecture

Intent Agent

Context / Retrieval Agent (RAG)

Execution / Generation Agent

Validation / Safety Layer

Preliminary Technical Goals

Open Questions

Related topics