Helper script to populate reports to analyze code evolution overtime based on Codee metrics#10

Open

tamara-laranga wants to merge 2 commits intomainfrom

add/multipleReportsAnalyzer

Collaborator

tamara-laranga commented Apr 6, 2026

Add a script that generates an HTML report from Codee HTML/JSON outputs. The report summarizes the evolution of Codee checkers across the codebase, enabling comparison of findings between runs and helping track improvements or regressions over time.

tamara-laranga requested a review from ulisescosti

April 6, 2026 10:16

tamara-laranga self-assigned this

ulisescosti reviewed

View reviewed changes

Collaborator

ulisescosti left a comment

Good to know that we’re finally about to have a post-processing script ready to be shared publicly!

I left some comments and suggestions in the review.

Additionally, I suggest to add a sub directory in scripts. For example:
scripts/post_processing_codee_json_reports/

scripts/README.md Outdated Show resolved Hide resolved

scripts/README.md Outdated Show resolved Hide resolved

scripts/README.md Outdated Show resolved Hide resolved

scripts/README.md Outdated Show resolved Hide resolved

scripts/templates/codee_report.html Outdated Show resolved Hide resolved

scripts/templates/codee_report.html Outdated Show resolved Hide resolved

scripts/README.md Outdated Show resolved Hide resolved

scripts/README.md Outdated Show resolved Hide resolved

tamara-laranga added 2 commits

April 7, 2026 09:43


          Add scripts files

507d01b


          Add README.md

e3f687d

tamara-laranga force-pushed the add/multipleReportsAnalyzer branch from e77e133 to e3f687d Compare

April 7, 2026 07:43

tamara-laranga requested a review from ulisescosti

April 7, 2026 07:44

ulisescosti reviewed

View reviewed changes

Collaborator

ulisescosti left a comment

Now I took a quick look at the Python script and left some comments and suggestions.

scripts/post_processing_codee_reports/analyze_codee_reports.py

+                  try:
+                      content = report_js_path.read_text(encoding="utf-8")
+                      match = re.search(r"const\s+report\s*=\s*(\{.*\});", content, re.DOTALL)

Collaborator

ulisescosti Apr 7, 2026

I miss a comment here indicating the purpose of these lines.

scripts/post_processing_codee_reports/analyze_codee_reports.py

+                      with open(file_path, "r", encoding="utf-8") as f:
+                          return json.load(f)
+                  except json.JSONDecodeError as e:
+                      logger.warning(f"Malformed JSON in {file_path.name}: {e}")

Collaborator

ulisescosti Apr 7, 2026

Suggested change

      
                    logger.warning(f"Malformed JSON in {file_path.name}: {e}")
          
                    logger.error(f"Malformed JSON in {file_path.name}: {e}")

If the issue clearly prevents the program from performing the given task, I think it should be an error message rather than a warning.

scripts/post_processing_codee_reports/analyze_codee_reports.py

+                      logger.warning(f"Malformed JSON in {file_path.name}: {e}")
+                      return None
+                  except OSError as e:
+                      logger.warning(f"Failed to read {file_path.name}: {e}")

Collaborator

ulisescosti Apr 7, 2026

Suggested change

      
                    logger.warning(f"Failed to read {file_path.name}: {e}")
          
                    logger.error(f"Failed to read {file_path.name}: {e}")

scripts/post_processing_codee_reports/analyze_codee_reports.py

+                  """Load a Codee HTML report by parsing report.js file."""
+                  report_js_path = report_dir / "report.js"
+                  if not report_js_path.exists():
+                      logger.warning(f"report.js not found in {report_dir.name}")

Collaborator

ulisescosti Apr 7, 2026

Suggested change

      
                    logger.warning(f"report.js not found in {report_dir.name}")
          
                    logger.error(f"report.js not found in {report_dir.name}")

scripts/post_processing_codee_reports/analyze_codee_reports.py

+              def load_html_report(report_dir: Path, logger: logging.Logger) -> dict | None:
+                  """Load a Codee HTML report by parsing report.js file."""
+                  report_js_path = report_dir / "report.js"
+                  if not report_js_path.exists():

Collaborator

ulisescosti Apr 7, 2026

Why isn't this conditional withing the try-catch block? It might also be a separate try-catch block.

scripts/post_processing_codee_reports/analyze_codee_reports.py

+                          reports.append((timestamp, report, report_dir))
+                  elif input_type == "json":
+                      logger.info("Detected JSON format (recursive search for *.json)")

Collaborator

ulisescosti Apr 7, 2026

Suggested change

      
                    logger.info("Detected JSON format (recursive search for *.json)")
          
                    logger.info("Detected JSON format")

Idem

scripts/post_processing_codee_reports/analyze_codee_reports.py

+                          l_level_data[l_level].append(p_groups[l_level])
+                  # Build links to original reports (JSON or HTML)
+                  report_links: list[dict | None] = []

Collaborator

ulisescosti Apr 7, 2026

I think it would be valuable to leave a basic example of the data structure in comments. It would help the code to be easier to understand.

scripts/post_processing_codee_reports/analyze_codee_reports.py

+              def main() -> int:
+                  parser = argparse.ArgumentParser(
+                      description="Analyze Codee JSON or HTML reports and generate HTML visualizations."

Collaborator

ulisescosti Apr 7, 2026

Suggested change

      
                    description="Analyze Codee JSON or HTML reports and generate HTML visualizations."
          
                    description="Parse Codee JSON or HTML reports and generate an HTML summary visualization."

scripts/post_processing_codee_reports/analyze_codee_reports.py

+                  parser.add_argument(
+                      "output_dir",
+                      type=Path,
+                      help="Directory where HTML report will be saved",

Collaborator

ulisescosti Apr 7, 2026

Suggested change

      
                    help="Directory where HTML report will be saved",
          
                    help="Directory where the HTML report will be saved",

scripts/post_processing_codee_reports/analyze_codee_reports.py

+                  reports = load_reports(args.input_dir, logger)
+                  if not reports:
+                      logger.warning("No valid reports loaded. Exiting.")

Collaborator

ulisescosti Apr 7, 2026

Suggested change

      
                    logger.warning("No valid reports loaded. Exiting.")
          
                    logger.error("No valid reports loaded. Exiting.")

Collaborator

ulisescosti commented Apr 7, 2026

I've run the script with a sample input dataset, and it looks great!
I'll suggest updated color codes once Fani from Marketing shares the color palette. Aside from that, it’s an excellent basic demonstration of what users can do with Codee reports.

Regarding the UX, I would recommend including a solid example input dataset in the repository, along with the exact commands users need to follow to parse the data and generate the summary HTML report.

In my opinion, these example datasets should include at least five JSON reports and five HTML reports. What do you think?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet