pr-agent/docs/docs/core-abilities/self_reflection.md

## TL;DR

Qodo Merge implements a **self-reflection** process where the AI model reflects, scores, and re-ranks its own suggestions, eliminating irrelevant or incorrect ones.
This approach improves the quality and relevance of suggestions, saving users time and enhancing their experience.
Configuration options allow users to set a score threshold for further filtering out suggestions.

## Introduction - Efficient Review with Hierarchical Presentation


Given that not all generated code suggestions will be relevant, it is crucial to enable users to review them in a fast and efficient way, allowing quick identification and filtering of non-applicable ones.

To achieve this goal, Qodo Merge offers a dedicated hierarchical structure when presenting suggestions to users:

- A "category" section groups suggestions by their category, allowing users to quickly dismiss irrelevant suggestions.
- Each suggestion is first described by a one-line summary, which can be expanded to a full description by clicking on a collapsible.
- Upon expanding a suggestion, the user receives a more comprehensive description, and a code snippet demonstrating the recommendation.

!!! note "Fast Review"
    This hierarchical structure is designed to facilitate rapid review of each suggestion, with users spending an average of ~5-10 seconds per item.

## Self-reflection and Re-ranking

The AI model is initially tasked with generating suggestions, and outputting them in order of importance.
However, in practice we observe that models often struggle to simultaneously generate high-quality code suggestions and rank them well in a single pass.
Furthermore, the initial set of generated suggestions sometimes contains easily identifiable errors.

To address these issues, we implemented a "self-reflection" process that refines suggestion ranking and eliminates irrelevant or incorrect proposals.
This process consists of the following steps:

1. Presenting the generated suggestions to the model in a follow-up call.
2. Instructing the model to score each suggestion on a scale of 0-10 and provide a rationale for the assigned score.
3. Utilizing these scores to re-rank the suggestions and filter out incorrect ones (with a score of 0).
4. Optionally, filtering out all suggestions below a user-defined score threshold.

Note that presenting all generated suggestions simultaneously provides the model with a comprehensive context, enabling it to make more informed decisions compared to evaluating each suggestion individually.

To conclude, the self-reflection process enables Qodo Merge to prioritize suggestions based on their importance, eliminate inaccurate or irrelevant proposals, and optionally exclude suggestions that fall below a specified threshold of significance.
This results in a more refined and valuable set of suggestions for the user, saving time and improving the overall experience.

## Example Results

![self_reflection](https://codium.ai/images/pr_agent/self_reflection1.png){width=768}
![self_reflection](https://codium.ai/images/pr_agent/self_reflection2.png){width=768}


## Appendix - Relevant Configuration Options
```
[pr_code_suggestions]
suggestions_score_threshold	= 0 # Filter out suggestions with a score below this threshold (0-10)
```
TLDR 2024-09-16 09:21:52 +03:00			`## TL;DR`

Format files by `pre-commit run -a` Signed-off-by: Yu Ishikawa <yu-iskw@users.noreply.github.com> 2024-10-30 09:56:03 +09:00			`Qodo Merge implements a self-reflection process where the AI model reflects, scores, and re-ranks its own suggestions, eliminating irrelevant or incorrect ones.`
			`This approach improves the quality and relevance of suggestions, saving users time and enhancing their experience.`
TLDR 2024-09-16 09:21:52 +03:00			`Configuration options allow users to set a score threshold for further filtering out suggestions.`

			`## Introduction - Efficient Review with Hierarchical Presentation`
self_reflection 2024-09-15 14:47:27 +03:00

			`Given that not all generated code suggestions will be relevant, it is crucial to enable users to review them in a fast and efficient way, allowing quick identification and filtering of non-applicable ones.`

Qodo Merge rename 2024-09-29 17:15:49 +03:00			`To achieve this goal, Qodo Merge offers a dedicated hierarchical structure when presenting suggestions to users:`
self_reflection 2024-09-15 14:47:27 +03:00
self_reflection 2024-09-15 14:50:24 +03:00			`- A "category" section groups suggestions by their category, allowing users to quickly dismiss irrelevant suggestions.`
self_reflection 2024-09-15 14:47:27 +03:00			`- Each suggestion is first described by a one-line summary, which can be expanded to a full description by clicking on a collapsible.`
			`- Upon expanding a suggestion, the user receives a more comprehensive description, and a code snippet demonstrating the recommendation.`

TLDR 2024-09-16 09:21:52 +03:00			`!!! note "Fast Review"`
			`This hierarchical structure is designed to facilitate rapid review of each suggestion, with users spending an average of ~5-10 seconds per item.`
self_reflection 2024-09-15 14:47:27 +03:00
TLDR 2024-09-16 09:21:52 +03:00			`## Self-reflection and Re-ranking`
self_reflection 2024-09-15 14:47:27 +03:00
			`The AI model is initially tasked with generating suggestions, and outputting them in order of importance.`
			`However, in practice we observe that models often struggle to simultaneously generate high-quality code suggestions and rank them well in a single pass.`
self_reflection 2024-09-15 14:50:24 +03:00			`Furthermore, the initial set of generated suggestions sometimes contains easily identifiable errors.`
self_reflection 2024-09-15 14:47:27 +03:00
Format files by `pre-commit run -a` Signed-off-by: Yu Ishikawa <yu-iskw@users.noreply.github.com> 2024-10-30 09:56:03 +09:00			`To address these issues, we implemented a "self-reflection" process that refines suggestion ranking and eliminates irrelevant or incorrect proposals.`
self_reflection 2024-09-15 14:47:27 +03:00			`This process consists of the following steps:`

			`1. Presenting the generated suggestions to the model in a follow-up call.`
self_reflection 2024-09-15 14:50:24 +03:00			`2. Instructing the model to score each suggestion on a scale of 0-10 and provide a rationale for the assigned score.`
self_reflection 2024-09-15 14:47:27 +03:00			`3. Utilizing these scores to re-rank the suggestions and filter out incorrect ones (with a score of 0).`
			`4. Optionally, filtering out all suggestions below a user-defined score threshold.`

			`Note that presenting all generated suggestions simultaneously provides the model with a comprehensive context, enabling it to make more informed decisions compared to evaluating each suggestion individually.`

Qodo Merge rename 2024-09-29 17:15:49 +03:00			`To conclude, the self-reflection process enables Qodo Merge to prioritize suggestions based on their importance, eliminate inaccurate or irrelevant proposals, and optionally exclude suggestions that fall below a specified threshold of significance.`
self_reflection 2024-09-15 14:50:24 +03:00			`This results in a more refined and valuable set of suggestions for the user, saving time and improving the overall experience.`
self_reflection 2024-09-15 14:47:27 +03:00
TLDR 2024-09-16 09:21:52 +03:00			`## Example Results`
self_reflection 2024-09-15 14:47:27 +03:00
			`![self_reflection](https://codium.ai/images/pr_agent/self_reflection1.png){width=768}`
			`![self_reflection](https://codium.ai/images/pr_agent/self_reflection2.png){width=768}`


TLDR 2024-09-16 09:21:52 +03:00			`## Appendix - Relevant Configuration Options`
self_reflection 2024-09-15 14:47:27 +03:00			```
			`[pr_code_suggestions]`
			`suggestions_score_threshold = 0 # Filter out suggestions with a score below this threshold (0-10)`
Format files by `pre-commit run -a` Signed-off-by: Yu Ishikawa <yu-iskw@users.noreply.github.com> 2024-10-30 09:56:03 +09:00			```