First Production Release After Public Testing Phase [1.0.0]
First Major Release of the Production-Ready Product Following Completion of the Public Testing Phase
Features:
- Query Planner – RAG Retrieval Overhaul: A lightweight LLM call (gpt-5.4-mini) now runs before document retrieval to generate a cleaned, topic-focused search query, replacing the previous embedding-based topic-shift detection heuristic. Result: more accurate RAG retrieval and elimination of topic contamination (e.g., VPN and HPC being merged into a single incorrect query). Fully configurable at runtime through a new tab in the Env Manager, with a robust fallback ensuring that planner failures or timeouts never interrupt a chat session.
- Domain-Specific Search Language: HPC-/cluster-related queries are searched in English (matching the language of the knowledge base), while responses remain in the user’s preferred language.
- Explicit Language Switching: Instructions such as “answer in English” or “auf Deutsch bitte” now switch the response language reliably.
- More Accurate Incident Detection: Incident banners are displayed only when users explicitly ask about a disruption, rather than whenever a service is merely mentioned.
- Screenshot-Upload (Vision Support): Images can be attached via paste, file selection, or drag-and-drop. The client downsizes uploads to WebP, while the server validates and re-encodes image data using Pillow. Screenshots remain available throughout follow-up questions and are automatically removed after a configurable retention period.
- Dark Mode: New theme system with a dedicated dark stylesheet and theme toggle.
- Per-Message Test Cases and promptfoo Export: Administrators can mark individual question–answer turns as test cases, define the expected answer and correct documentation references, and export everything as promptfoo-compatible YAML files for automated quality evaluation.
Intern/Architektur
- The monolithic bot.py (775 lines) has been refactored into a structured app/bot/ package (pipeline, client, search, language, tokens, incidents, query_planner).
- routes.py has been modularized into subpackages under app/routes/.
- Integrated deployment webhook (seko_deploy_webhook), expanded test coverage, and CI/CD updates.
Bugfixes and Improvements:
- Source Attribution: Improved tracking of RAG source documents. Shared chats now display the referenced documentation under a dedicated Sources section.
- Feedback Review System Overhaul: Refactored review interface, introduced a flag queue for suspicious chats, and normalized the database schema using enums and lookup tables for feedback categories and AI models.
- Env Manager Redesign: Database-backed configuration management with bidirectional .env-synchronization, side-by-side configuration views, a dedicated write user, and export of merged .env files.
- Incident Processing: Improved prioritization by severity, time-based filtering, token-budget-aware truncation, and clickable incident links.
- Updated GPT Model Support: Token tracking and model routing updated for the latest GPT-5.x deployments.
- Evaluation API: New endpoint for automated evaluation runs.
Fix for Context Window Overflow in the Response Endpoint (v1.20.0)
Bugfixes and Improvements:
- The response endpoint now returns an error message when the context window is exceeded
Support for New Models and Enhancements for Reports, Pricing, and Token Limits (v1.20.0)
Features:
- Support for new models added: gpt-5.4 and gpt-5.5
- Reports can now be exported as JSON or CSV
- Tiered pricing based on input tokens is now supported
- Input tokens can now be limited separately per deployment for frontend and API
Bugfixes and Improvements:
- Image models now return an error message if generation fails due to the content filter
- The response endpoint now returns an error message when the context window is exceeded
ML compute quota now shown in GPU-h
We would like to inform users of the following changes to r_wlm_usage.
Users can now see that the r_wlm_usage and r_wlm_usage -q commands display different quotas for MPI (CPU Only) and ML (GPUs) projects. The r_wlm_usage table will display for the same project, two distinct sections as follows:
=== HPC partitions (CPU Core-h) ===
….
=== ML partitions (GPU-h) ===
….
These changes are mostly in how the values are displayed as GPU-h. The used quotas have not changed and we expect no negative effects to previous, current or future quota utilization. Already used and allocated quotas within ML (GPU) systems were converted with the pre-existing ratio of 24CPU-h per 1 GPU-h. This change is to help distinguish between CPU and GPU usage within the NHR directives.
Additionally the default GPU-h quota for the ‘default’ project of every account running on GPU nodes has been set to 10 GPU-h.
This means that users can use their ‘default’ CPU-h quota without using the GPU-h quota and vice versa. The same rules of over-quota apply to both types.
New Features in the asknet Software-Shop Interface
The interface of the Software Shop has been changed. All available items are now displayed directly on the start page. The previously necessary selection of a price group before ordering is therefore no longer required.
Price groups can still be conveniently selected using a filter on the left side. In addition, price groups with an Org-ID have been named according to their respective function to facilitate orientation.
If items from different price groups are ordered, the system automatically separates them in the shopping cart, for example for students, “IT-Bestellende” or in the future “B Produkt – MS Select Plus”.
Further information on IT Center Help
Mathematica 15 Not Yet Approved
A few days ago, the new version Mathematica 15 was released. A new feature of this version is the free access to Wolfram AI for registered users.
However, since the servers of this AI service are not located in Europe and its use has not yet been approved by RWTH Aachen University, we are currently investigating how to deactivate the service.
Until this review is completed, we will neither provide Mathematica 15 on the license server nor approve license requests for this version.
Bis auf Weiteres sollte Mathematica 14.3.0 oder eine ältere Version verwendet werden.
Fixed Shared Chat Permission Issue (v1.19.0)
Bugfixes and Improvements:
- An issue with permission checks for creating shared chats has been fixed.
Model Access Restored and Sharing Permissions Fixed (v1.19.0)
Bugfixes and Improvements:
- Some models that were temporarily unavailable for admins are now selectable again: glm-4.7, apertus-70b-instruct-2509, and deepseek-r1-distill-llama-70b.
- An issue with permission checks for creating shared chats has been fixed.
Firefox Replaced on Dialog Nodes
The Firefox web browser was uninstalled from all HPC cluster dialog nodes due to lacking security updates for the given system configuration. Moreover, Firefox was deployed the modules tree to maintain its availability until the update problems using the OS packages can be fully resolved. However, this strategy requires to load the “Firefox” module using the command line first and executing it then on the command line as well.
To workaround these limitations, we are providing a wrapper script and a desktop definition, such that you should be able to continue using Firefox seamlessly without any further actions required.
You can track any disruptions or security advisories that may occur due to the aforementioned change in the RWTH High Performance Computing (HPC) category on our status reporting portal.
Release Notes Version 2.54.0
Improvements and bug fixes:
- Minimum operating system version required to use the RWTHapp has been increased [iOS only]
- Various bug fixes

