Training an LLM to generate queries from natural language prompts - GSoC 2025

Hello, my name is Jigyasu. I am an electronics engineering undergraduate. I was going through the NumFOCUS accepted projects and found AiiDA there. I am interested in working on this project for GSoC '25 and I just wanted to say hi!

I have research experience in LLMs where I have fine-tuned them before, to be specific, I fine-tuned Google’s Gemma on the MedQuAD dataset. I am also collaborating with a team at UofSC to research watermarking methods for LLM-generated text.

I like open-source software because I believe it is the epitome of what human collaboration can do and in the past, I have contributed actively to sktime, a machine learning framework for time series and I am experienced in object-oriented programming paradigms from there.

I just set up the development environment and I will open a PR soon :slight_smile:

I would also like to join the biweekly meetings, so please let me know the schedule of it.

Redirect to Google Summer of Code (GSoC) 2025