Release NotesEN April 30, 2026 3 min readvon Klara

BLXBench 0.8.5: Cleaner HTML Output for More Reliable AI Model Benchmarks

The latest BLXBench release fixes HTML artifact issues in visual previews, making AI model benchmarking workflows smoother and more reliable.

blxbenchreleaseai-benchmarkinghtmltui

If you've been using BLXBench to benchmark AI models and noticed occasional stray markdown fences or wrapper text appearing in your HTML visual previews, you're not alone. The 0.8.5 release, available today, addresses these exact pain points with targeted improvements to HTML artifact handling.

What Changed in 0.8.5

Two specific improvements make HTML output cleaner and more reliable:

1. HTML Artifact Cleanup in Coding/UI Previews

When a model wraps a single-file HTML result in a code block (like html), BLXBench used to display the markdown fences (` html and ``` `) as part of the visual preview. This was distracting and made it harder to focus on the actual rendered output.

Fix: The TUI now strips these code fences before rendering, so you see only the clean HTML output as intended.

2. Enhanced HTML Cleanup for Visual Uploads

Beyond just code fences, the visual preview system now more aggressively strips other markdown leftovers that could interfere with HTML rendering:

  • Loose code-fence labels
  • Stray first-line html tags
  • Other markdown artifacts that might appear alongside generated HTML

Result: Generated single-file HTML now renders cleanly in previews without wrapper text or visual noise beside the actual artifact.

Why This Matters for Your Workflow

These might seem like small fixes, but they directly impact the reliability of your AI model evaluation process:

  • Less visual noise: Cleaner previews mean faster interpretation of results
  • More accurate assessment: When what you see matches what the model actually produced, your benchmarks are more trustworthy
  • Smoother presentation: If you're sharing results with teammates or stakeholders, cleaner output looks more professional
  • Reduced cognitive load: Fewer distractions let you focus on comparing model performance rather than deciphering display artifacts

Technical Details

Under the hood, these improvements touch the HTML processing pipeline in both the TUI and web visualization components. The changes are intentionally narrow—focused only on cleaning up presentation-layer artifacts without altering the underlying benchmarking logic or model interaction protocols.

For teams using BLXBench in automated CI/CD pipelines or shared evaluation environments, this means more consistent visual output across different runs and environments.

Getting the Update

If you're already using BLXBench, you can update via your preferred package manager:

# npm
npm update -g @bitslix/blxbench@latest

# pnpm  
pnpm update -g @bitslix/blxbench@latest

# Bun
bun update -g @bitslix/blxbench@latest

Or download the latest platform-specific binaries from the BLXBench download page.

As always, we recommend reviewing the full changelog for complete details on this and previous releases.

Looking Ahead

The 0.8.5 release continues our focus on refining the user experience based on real-world usage patterns. If you've encountered other rough edges in your benchmarking workflows—whether in the TUI, web app, or reporting features—we'd love to hear about them. Your feedback helps us prioritize improvements that make BLXBench more useful for AI engineering teams everywhere.

Happy benchmarking!