[GSoC 2025 PROPOSAL] Yuxiang Jiang - Jenkins-specific LLM for CI Failure Analysis

YuxiangJiangCT · April 10, 2025, 7:53pm

Hi Jenkins community ,

My name is Yuxiang(Ryan) Jiang, and I’m currently pursuing a controller’s in Computer and Information Science at Cornell University. I’m passionate about combining machine learning, developer tools, and systems optimization to solve practical engineering problems.

For GSoC 2025, I’ve submitted a proposal titled “Fine-tuning a Jenkins-Specific LLM for CI Failure Analysis.” This project is based on the official idea “Domain-specific LLM based on Jenkins usage data” and aims to build an intelligent assistant that can help Jenkins users diagnose build and test failures using a fine-tuned LLM or Retrieval-Augmented Generation.

The proposal outlines:

A data pipeline for processing real CI logs from ci.jenkins.io
Fine-tuning or RAG integration using LLaMA 2 or similar open-source models
A web-based assistant (React + Django/FastAPI)
Target use-case: classifying failures as infra issues, flaky tests, or actual bugs

I’ve reviewed the 2024 Jenkins LLM project, set up a Jenkins instance locally, and started participating in the Gitter channel. I’m excited about contributing to Jenkins and exploring how AI can improve CI/CD developer experience.

Proposal Attachment：
You can view my full proposal here on Google Docs:
Fine-tuning a Jenkins-Specific LLM for CI Failure Analysis
The document is publicly viewable, and feedback or comments are very welcome!

Looking forward to learning from the community and collaborating with mentors or contributors who have worked on related projects!

Thanks!
Yuxiang

Topic		Replies	Views
[GSOC 2025 Proposal] for the project “Domain-specific LLM based on actual Jenkins usage using ci.jenkins.io data” GSoC	1	71	March 29, 2025
[GSoC 2025] Kavinkumar Baskar - Domain specific LLM based on actual Jenkin usage using ci.jenkins.io data GSoC	1	55	April 1, 2025
[GSOC 2025 Proposal] for the project "Domain-specific LLM based on actual Jenkins usage using ci.jenkins.io data" GSoC	1	83	March 26, 2025
Jenkins GSoC 2025 introduction post - Domain-specific LLM based on actual Jenkins usage using ci.jenkins.io data GSoC	1	78	March 24, 2025
[GCoC 2025 PROPOSAL] Kavypriya - AI-powered chatbot for Jenkins GSoC	1	77	April 1, 2025

[GSoC 2025 PROPOSAL] Yuxiang Jiang - Jenkins-specific LLM for CI Failure Analysis

Related topics