Home • Blog • AI in Counter-Drone Systems: From Detection to Neutralization

AI in Counter-Drone Systems: From Detection to Neutralization

16 June 2026

Table of contents

1. From Detection to Decision: The Evolution of Counter-Drone Systems

Counter-drone capability is no longer a niche air-defence add-on. It is becoming a core layer of force protection, base defence, manoeuvre support, and critical-infrastructure resilience. Recent policy from the U.S. Department of Defense treats the rapid proliferation of unmanned systems as a strategic problem, not merely a tactical one, and links the threat directly to growing autonomy, AI, networking, and mass availability. In practice, that means decision-makers should stop asking whether AI belongs in counter-UAS and start asking where in the kill chain it delivers measurable advantage without creating unacceptable legal, cyber, or operational risk.

The strongest emerging design pattern is not “one better sensor” but a layered system-of-systems: radar for wide-area surveillance, RF/SIGINT for emissions-based early warning and attribution, EO/IR for recognition, acoustic sensing for close-range passive cueing, and AI-driven fusion to reduce false alarms, prioritize tracks, and compress operator workload. That architecture aligns with current Army sensor-integration efforts and reflects a broader shift toward. For organizations building counter-drone capabilities, the implication is clear: the defensible value lies not in a single model, but in open integration, common data models, edge-ready inference, secure middleware, and verification pipelines that connect sensors, C2 workflows, and effectors into a functioning whole.

2. The Problem AI Must Solve

The problem statement is sharper than “detect the drone.” A defendable counter-drone AI stack must identify a small, low, slow, and often low-cost target in clutter; distinguish it from birds, friendly UAS, or civilian traffic; maintain track continuity under manoeuvre and intermittent observability; estimate intent and threat level; and support a lawful neutralization decision quickly enough to matter. The operational burden is compounded by the fact that many drones are cheap enough to be used in swarms or in repeated probing attacks, which puts enormous pressure on operator attention and on the cost-per-engagement equation.

That is why current defence thinking places increased emphasis on machine-speed decision support, passive and active defences, and layered architectures that can scale from installation protection to mobile formations. Army C-UAS experimentation now explicitly frames the requirement around integrating best-of-breed sensors, reducing cognitive load, and speeding decisions from human tempo toward machine tempo, while still keeping commanders and operators responsible for force application.

3. Sensing Modalities and Multi-Sensor Fusion

No single sensor closes the counter-drone problem. Recent reviews and programme evidence converge on the same point: radar, RF, EO/IR, acoustic, and passive sensing each solve different parts of detection, classification, and localization, and each fails under different conditions. Radar remains the backbone for all-weather surveillance and early track generation. EO/IR remains the strongest route to visual confirmation and forensic-quality evidence. RF and SIGINT layers can classify protocols, identify emitters, or exploit Remote ID and telemetry when they are present. Acoustic sensing adds a cheap passive layer at shorter range, especially in the last hundreds of metres. The result is a strong bias toward fused architectures rather than monolithic point solutions.

The state of the art is moving from simple sensor “stacking” to explicit fusion at different levels. Pereira et al. (2024) compare pixel-level and decision-level EO/IR fusion around a YOLOv7-plus-ByteTrack pipeline. Arapoglou et al. (2025) describe hierarchical multi-sensor threat detection and decision-making. More recent anti-UAV work also divides fusion into early/data-level, feature-level, and late/decision-level approaches, with growing interest in hierarchical combinations that preserve robustness when one modality degrades. The practical lesson for procurement is straightforward: ask not only whether a vendor fuses sensors, but where fusion occurs, what timing assumptions it needs, how it degrades when one modality drops out, and how outputs are exposed to C2.

3.1 Sensor comparison

Modality	Indicative range	Practical resolution and identification value	Strengths	Main limitations	Typical cost and integration complexity
Radar	Roughly 2-5+ km for many small-UAS use cases	Good range and velocity; some systems support micro-Doppler cues for class discrimination	All-weather, day/night, wide-area search, fast track initiation	Small RCS targets, clutter, multipath, false alarms without fusion	Medium to high
Acoustic	Roughly 50-200 m in noisy settings; farther in quiet environments	Good bearing with arrays; poor direct ranging unless fused	Passive, low cost, useful for close-in cueing and redundancy	Noise, wind, urban masking, limited reach	Low to medium
EO/IR	Roughly 0.5-2+ km for practical recognition, optics-dependent	Very high angular detail; strongest for confirmation and BDA	Positive ID, visual evidence, day/night with thermal	Weather, haze, camouflage, occlusion, weak native depth	Medium
RF detection and Remote ID exploitation	Roughly 1-3+ km for common control and telemetry links; farther when Remote ID conditions are favorable	Strong protocol and device discrimination; coarse geolocation unless multi-node	Fast early warning when the target emits; low collateral burden	Fails against RF-silent, autonomous, or fiber-linked drones	Low to medium
SIGINT and passive RF geolocation	Highly emitter- and geometry-dependent; often km-scale LOS coverage	Can support attribution, emitter characterization, and multi-node geolocation	Valuable for intent inference and network-level picture	Not all threats emit; requires timing, baselining, and spectrum expertise	Medium to high

The ranges above are indicative, not procurement specifications. They synthesize representative values from recent reviews and exemplar systems: NATO multistatic radar work reports drone-detection ranges up to 5 km, RF-based studies report strong performance past 2-3 km for emitting targets, EO/IR effectiveness is highly optics- and cueing-dependent, and acoustic systems can collapse to roughly 50-200 m in noisy environments even when they remain valuable as a passive confirmation layer. Cost and complexity are inferential, based on hardware, calibration, synchronization, and network-integration demands rather than a single official price baseline.

4. AI Models Across the Counter-Drone Workflow

The model landscape is already specialized by function. CNN-style detectors and YOLO-family models still dominate real-time EO/IR detection because they fit strict latency budgets. Sequence models are increasingly used to suppress hard false positives such as birds or clutter trajectories. Akyon et al. (2022) show 3D CNN, LSTM, and transformer-style sequence classifiers for drone-vs-bird discrimination. Pereira et al. (2024) pair YOLOv7 with ByteTrack. CVPR Anti-UAV benchmark results in 2025 show that the most competitive trackers are still hybrid systems, blending learned detection with motion-aware association rather than relying on “pure AI” end-to-end pipelines.

Fusion models are also maturing. Recent work spans multimodal transformers for radar-acoustic-video fusion, hierarchical visible/infrared fusion, RF open-set recognition models, and graph-based anomaly detection over flight telemetry. Dong et al. (2025) identify multimodal fusion, self-supervision, adversarially oriented benchmarks, and synthetic-data generation as the main frontier areas. Feng et al. (2025) push anomaly detection toward causality-enhanced graph neural networks, which is especially relevant for identifying abnormal flight behaviour, spoofing effects, or mission-profile deviations that a single image frame cannot reveal. MMAUD (2024) matters here because it provides a rare public benchmark with stereo vision, LiDAR, radar, audio arrays, and accurate ground truth for detection, classification, and trajectory estimation.

In operational terms, the workflow is best thought of as four linked AI functions rather than one monolithic “autonomous” block:

Detection and cueing: radar, RF, SIGINT, acoustic, or wide-FOV video flag candidate objects and hand them to higher-cost recognition models.
Classification and identification: CNNs, spectrogram classifiers, sequence models, and multimodal transformers distinguish hostile drones from birds, friendly UAS, or benign aerial objects.
Tracking and intent estimation: trackers such as ByteTrack, adaptive Kalman variants, and motion-association logic preserve continuity through occlusion, target loss, or erratic manoeuvre.
Neutralization support: threat-ranking and policy engines recommend options such as monitoring, handoff, soft-kill, or hard-kill, but the decision should remain bounded by rules-of-engagement, legal review, airspace deconfliction, and system state confidence.

5. Edge AI, Cybersecurity, and Adversarial Robustness

Edge deployment is where many promising demos fail. Recent studies on edge AI in defence systems point this out directly: counter-drone systems often need to run on mobile surveillance platforms at the edge, where compute, memory, power, and cooling are constrained. In military settings, those constraints sit on top of denied, degraded, intermittent, and low-bandwidth networking, so offloading everything to a remote cloud is often unrealistic. The right design response is not “bigger model, bigger GPU,” but model partitioning, selective inferencing, hardware-aware compression, graceful degradation, and a clear separation between edge-critical tasks and rear-echelon analytics.

Cybersecurity has to cover the full AI-and-sensor lifecycle. The NIST 2025 adversarial machine-learning taxonomy explicitly frames attacks across model methods, lifecycle stages, attacker goals, and attacker knowledge. The DoD’s 2025 AI cybersecurity tailoring guide likewise argues that cyber risk management must be integrated from the start of the AI lifecycle, not bolted on after model training. For counter-drone systems, that means protecting sensor firmware, timing and PNT, RF ingest, message brokers, feature stores, model artifacts, signed updates, and effector interfaces as one attack surface.

Operational robustness also has a policy dimension. NATO’s revised AI strategy and related certification work place lawfulness, responsibility, explainability, reliability, governability, and bias mitigation at the centre of defence AI. For counter-UAS, that translates into auditable operator displays, confidence-aware recommendations, known fallback modes, and the ability to disengage or revert when the system drifts outside validated operating conditions. In other words: a system that cannot explain why it recommends jamming or firing is not mature enough for serious deployment, regardless of benchmark accuracy.

6. C2 Integration and Rules of Engagement

AI does not replace the C2 stack; it becomes a decision-support layer inside a broader C4ISR architecture. Current Army integration work is instructive here. Integrated Sensor Architecture is explicitly designed to let sensors from different manufacturers interoperate through common standards, reduce translation bottlenecks, and lower latency at the tactical edge. NGC2 (Next Generation Command and Control), in turn, is explicitly data-centric, cloud-native, and built around open architectures. This makes the DoD Directive 3000.09 especially relevant, as it requires appropriate levels of human judgment over the use of force, alongside rigorous legal review, testing, and cybersecurity safeguards.

This matters acutely for electronic attack. A useful Polish-language reminder comes from the Polish Civil Aviation Authority’s GNSS interference seminar, which highlights how even anti-drone jamming incidents can produce wider aviation-side effects on navigation and surveillance environments. For system architects, that means soft-kill chains must be airspace-aware, spectrum-managed, geofenced, and fully logged. In business terms, buyers should prioritize traceable policy engines and authority management just as highly as raw sensor performance.

7. Testing, Validation, and Operational Lessons

Testing has to move well beyond static accuracy scores. The Chief Digital and Artificial Intelligence Office test-and-evaluation frameworks emphasize lifecycle T&E and operational realism; their core message is that justified confidence comes from testing AI-enabled capabilities under the complexities of real use, not from isolated lab metrics alone. Standardized counter-drone evaluation work is pushing the same direction: detection, tracking, and identification should be measured separately, under different weather, background clutter, target classes, false-positive tolerances, and decision-latency constraints.

Datasets and simulation are central because truly representative hostile-drone data are hard to collect. Public resources such as the Anti-UAV challenge, drone-vs-bird datasets, and MMAUD are increasingly important because they expose models to small-object, infrared, multimodal, and trajectory-estimation problems. But dataset work alone is insufficient. Teams need sim-to-real pipelines, red-teaming, replay environments, and cyber-range-style exercises that include spoofing, RF noise, degraded networks, operator overload, and sensor dropout. That is consistent both with NATO’s use of cyber range and simulation for realistic training and with current anti-UAV research trends toward synthetic data and adversarial benchmarking.

Operational examples reinforce the point. NATO’s 2023 and 2024 counter-drone exercises have emphasized interoperability, while Ukrainian participation in the 2024 C-UAS TIE explicitly connected allied experimentation to battlefield lessons on drone autonomy and interoperability. The U.S. Army 2025 Project Flytrap 4.5 series tested detect-discriminate-defeat products against simulated drone threats in NATO airspace and framed the exercise as a coalition environment for passive and active sensors, defeat options, data flow, and interoperability. Separately, recent Army C5ISR work on FoCUS shows the value of modular, government-owned software that integrates multiple sensing modalities into a single platform, reduces cognitive load, and can be fielded across echelons. These are strong signals for buyers: insist on experimentation in realistic networks and coalition contexts, not just demo-day drone shots against a blue sky.

8. Conclusion: Integration Is the Real Advantage

The future of counter-drone systems will not be decided by a single breakthrough model or sensor. It will be shaped by the ability to integrate detection, classification, tracking, and decision-making into a coherent, reliable, and secure system. Organizations that invest only in point solutions will face fragmentation, latency, and operational risk. Those that focus on integration, data consistency, and system-level design will gain a decisive advantage – not just in detection, but in actionable decision-making.

For defence stakeholders, the key question is no longer whether AI works. It is whether it is deployed in a way that is interoperable, explainable, and operationally reliable.

At Transition Technologies MS, we focus on building exactly these kinds of integrated, mission-ready systems – connecting sensors, AI models, and command layers into a unified operational environment. Learn more about our capabilities at TTMS Defence.

What is adversarial machine learning and why does it matter in defence systems?

Adversarial machine learning refers to techniques used to manipulate or deceive AI models by altering input data in subtle ways. In the context of counter-drone systems, this could mean tricking a detection model into misclassifying a drone as a harmless object or failing to detect it altogether.

This is particularly important in defence because AI systems operate in contested environments where adversaries actively attempt to disrupt or exploit them. Standards and frameworks developed by organizations such as NIST emphasize that security must be considered across the entire AI lifecycle – from data collection and model training to deployment and updates.

In practice, this means counter-drone systems must be designed to remain reliable even when inputs are noisy, incomplete, or intentionally manipulated.

What does “edge deployment” mean in military AI systems?

Edge deployment means running AI models directly on local devices – such as sensors, vehicles, or portable systems – rather than relying on centralized cloud infrastructure. This is critical in military environments where connectivity may be limited, unreliable, or intentionally disrupted.

For counter-drone systems, edge AI allows real-time detection and response without depending on external networks. However, it also introduces constraints related to processing power, memory, and energy consumption.

To address this, engineers use techniques such as model optimization, compression, and selective inference to ensure that AI systems remain both efficient and effective in field conditions.

What are RF, SIGINT, EO/IR, and acoustic sensors in drone detection?

These terms refer to different types of sensors used in counter-drone systems:

RF (Radio Frequency) sensors detect communication signals between a drone and its operator.
SIGINT (Signals Intelligence) expands on RF by analyzing and interpreting electronic signals for identification and attribution.
EO/IR (Electro-Optical / Infrared) sensors use visual and thermal imaging to detect and identify objects.
Acoustic sensors detect the sound signatures produced by drone motors and propellers.

Each of these sensors has strengths and limitations. For example, RF detection works well when a drone is actively communicating, while EO/IR provides visual confirmation. Modern systems combine multiple sensor types to improve accuracy and reliability.

What are YOLO models and pipelines like YOLOv7 + ByteTrack?

YOLO (You Only Look Once) is a family of real-time object detection models widely used in computer vision. These models are designed to identify objects in images or video streams quickly, making them suitable for time-sensitive applications such as drone detection.

A pipeline such as YOLOv7 combined with ByteTrack integrates detection and tracking. YOLOv7 identifies objects frame by frame, while ByteTrack maintains continuity by tracking those objects across multiple frames.

This combination allows systems not only to detect a drone but also to follow its movement over time, which is essential for threat assessment and response.

What is C4ISR / NGC2 and why is it important for counter-drone systems?

C4ISR stands for Command, Control, Communications, Computers, Intelligence, Surveillance, and Reconnaissance. It refers to the integrated systems that collect data, process it, and support decision-making in military operations.

NGC2 (Next Generation Command and Control) is a modern approach to C2 that emphasizes data-centric architectures, interoperability, and cloud-native design. It enables faster and more informed decision-making by connecting multiple data sources into a unified operational picture.

In counter-drone systems, this integration is critical. Detection alone is not enough – data must be combined, interpreted, and translated into actionable decisions within a broader operational context.

What is MMAUD and why are datasets important in counter-drone AI?

MMAUD is an example of a multimodal dataset used in anti-drone research. It combines data from multiple sensor types, such as video, radar, and audio, to support the development and evaluation of detection and tracking models.

Datasets like MMAUD are essential because they allow engineers to train and test AI systems under realistic conditions. However, collecting real-world data for hostile drone scenarios is difficult, which is why simulation and synthetic data are often used alongside real datasets.

The quality and diversity of training data directly impact how well a system performs in real operational environments.

TTMS blog – the world through the eyes of IT experts

NIS2 Cybersecurity in Pharma Requirements, Obligations,and Implementation in 2026

NIS2 cybersecurity in pharma is an operational resilience requirement, not a stand-alone IT project. A cyber incident can stop a filling line, isolate a laboratory, interrupt a cold chain, corrupt a clinical dataset or make a validated system unavailable. Each outcome can affect product quality, patient safety and continuity of supply. Directive (EU) 2022/2555, known as NIS2, creates a common EU baseline for cybersecurity risk management, management oversight and significant-incident reporting. The legal duty is implemented through national law. A company must therefore read the Directive together with the rules, thresholds, registration procedures and competent-authority guidance in every Member State where it falls within scope. This guide converts the legal baseline into actions and evidence for pharmaceutical manufacturers, biotechnology companies, medicinal-product R&D organisations, contract manufacturing organisations (CMOs), contract research organisations (CROs) and their critical suppliers. It also explains where NIS2 must be aligned with GxP, Computerized System Validation (CSV), Computer Software Assurance (CSA), GAMP 5 and existing quality-management processes. This article covers pharma-specific implementation. For the detailed evidence model, see the TTMS NIS2 compliance documentation and evidence checklist. 1. Why pharmaceutical operations are a priority cyber target under NIS2 NIS2 places the manufacture of basic pharmaceutical products and pharmaceutical preparations within the health sector in Annex I, alongside healthcare providers, EU reference laboratories and entities carrying out research and development of medicinal products. That classification reflects systemic impact: disruption can affect access to medicines and public-health response, not only one company’s balance sheet. The threat picture supports that treatment. ENISA reported that, among health-related incidents analysed for its 2024 threat landscape, 45% involved ransomware and 28% involved data breaches. A separate commercial dataset counted 4,198 ransomware cases exposed on dark-web leak sites across all sectors in the first half of 2025, 49% more than in the comparable 2024 dataset. The 4,198 figure is not pharma-specific, so it should not be presented as a count of attacks on pharmaceutical or biotechnology organisations. Pharma combines assets that create leverage for attackers: intellectual property, clinical and patient-related data, regulated production, scarce batches, time-sensitive logistics and a broad supplier network. The same identity platform, integration layer or remote-maintenance channel may connect corporate IT with ERP, MES, LIMS, ELN, EDC and operational technology (OT). An attacker does not need to compromise every system. Disrupting one shared dependency may be enough to stop release, testing or distribution. Treat the business impact as a chain. Map each critical product or service to facilities, processes, systems, data, utilities, people and third parties. Record the maximum tolerable outage and the quality consequences of data loss or delayed review. That service map becomes evidence for risk analysis, business continuity, recovery priorities and supply-chain decisions. 2. NIS2 in life sciences: scope, classification and legal status NIS2 expanded the EU cybersecurity baseline beyond the narrower NIS1 model. It applies, as a rule, to medium-sized and large entities of a type listed in Annex I or Annex II, subject to specific inclusions and exceptions. In life sciences, the legal analysis must start with what the entity actually does—not the brand description “pharma”, “biotech” or “healthcare”. Activities may include medicinal-product R&D, API or finished-product manufacture, device manufacture, clinical operations, distribution, marketing, digital services or combinations of them. A group can contain entities with different statuses. A CMO or CRO is not automatically in or out merely because of its label. The relevant activity, size, establishment, jurisdiction and any national designation must be documented. 2.1 From NIS1 to NIS2: what changed for health and pharma NIS2 widens sector coverage, standardises a minimum set of cybersecurity risk-management measures, sets a staged significant-incident reporting model and strengthens supervision and enforcement. It requires management bodies to approve risk-management measures, oversee implementation and receive training. It also requires Member States to maintain national cybersecurity strategies and incident-response structures. The result is a common baseline, not identical administration across the EU. Registration, thresholds, forms, competent authorities, language, audit expectations and sanctions are implemented nationally. In July 2026, the Commission referred Ireland, Spain, France and the Netherlands to the Court of Justice for failing to notify full transposition. Cross-border groups still need a jurisdiction register and local legal verification. Existing GMP and quality-management governance can provide a starting structure. Management review, change control, deviation management, CAPA, supplier qualification, training and periodic review already create owners and records. Extend those processes to cybersecurity; do not assume that GxP evidence automatically proves NIS2 compliance. 2.2 Essential or important entity? Classify before selecting controls Under Article 3, an Annex I entity that exceeds the ceiling for a medium-sized enterprise is generally an essential entity. Other medium-sized entities within Annex I or Annex II are generally important entities, unless a specific rule or national designation changes the result. Certain entity types are essential regardless of size. Micro and small enterprises are generally excluded, but Article 2 contains exceptions based on criticality and other factors. Pure distribution or marketing activity may fall outside the listed pharma categories when the entity performs no covered activity and is not designated on another basis. Conversely, an organisation conducting medicinal-product R&D can fall within Annex I even if it does not manufacture. Medical-device coverage also requires careful reading of the relevant Annex category; not every device business has the same classification. Create a signed scope memorandum for each legal entity. Include activities, NACE or equivalent classification, headcount and financial data, establishments, services, national rules, group dependencies and the reason for the conclusion. Record who approved it and when it must be reviewed. This memorandum is the first auditable artefact; a product brochure or a group-level assumption is not enough. 3. Four compliance pillars for pharmaceutical organisations Organise NIS2 around four connected pillars: risk management, significant-incident reporting, management accountability and supply-chain security. Each needs an owner, a procedure and operating evidence. 3.1 Article 21 risk management: ten minimum areas Article 21 requires appropriate and proportionate technical, operational and organisational measures based on an all-hazards approach. The ten minimum areas below should be mapped to services and risks, not treated as a generic tool-purchasing list. Article 21 area Pharma implementation focus Typical audit evidence 1. Risk analysis and information-system security policies Link product, patient and service impact to IT, OT and GxP systems Approved method, service map, risk register, treatment decisions 2. Incident handling Coordinate security, quality, privacy, legal, production and communications Incident plan, severity matrix, case records, after-action reports 3. Business continuity, backup, disaster recovery and crisis management Prioritise batch, laboratory, release and cold-chain dependencies BIA, RTO/RPO, recovery plans, restore tests, exercise reports 4. Supply-chain security Assess API, CMO, CRO, logistics, cloud and maintenance dependencies Supplier tiering, due diligence, contracts, monitoring, exit plans 5. Secure acquisition, development and maintenance, including vulnerability handling and disclosure Connect security changes to validated-state and change-control decisions Security requirements, threat models, vulnerability records, change packages 6. Assessment of control effectiveness Test design, coverage and operating results Control tests, metrics, internal audits, CAPA and closure evidence 7. Cyber hygiene and training Train by role, including engineers, laboratory staff and management Curricula, attendance, competence checks, phishing or exercise results 8. Cryptography and encryption Protect data and communications while managing keys and certificates Cryptography standard, key inventory, certificate monitoring, exceptions 9. HR security, access control and asset management Control joiners, movers, leavers, privileged access and system ownership Asset register, access reviews, PAM records, segregation-of-duties evidence 10. MFA or continuous authentication and secure communications Cover remote access, privileged actions and exposed services based on risk MFA coverage, exception register, secure-channel configuration and reviews Build requirements traceability between each NIS2 measure, the service risk, the control, the system owner and the evidence source. Existing GxP processes can carry part of the load. Vulnerability remediation can use change control; control testing can align with periodic review and CSA; security training can use the controlled learning system. The mapping must also expose gaps. A validated application with no tested recovery process remains a continuity risk. 3.2 Article 23 reporting: 24 hours, 72 hours and one month For a significant incident, Article 23 establishes staged reporting: an early warning without undue delay and within 24 hours after becoming aware; an incident notification without undue delay and within 72 hours; and a final report no later than one month after the incident notification. Intermediate or progress reports may also be required. If the incident is ongoing at the one-month point, a progress report replaces the final report and the final report follows within one month after handling ends. The clock starts from awareness, not from completion of a forensic investigation. Define who can declare awareness, who assesses significance, who contacts the national CSIRT or competent authority and who coordinates parallel duties under GDPR, sector rules, contracts and, where relevant, medical-device obligations. Preserve both the decision to report and a reasoned decision not to report. Real-time visibility across identity, network, endpoint, cloud, ERP, MES, LIMS, ELN, EDC and OT improves the chance of meeting the timetable. A central SIEM can support detection and chronology, but it does not make a legal significance assessment. Use a human-in-the-loop process with on-call authority, a current contact list, pre-approved templates and a decision log. In validated environments, deploy monitoring through approved change control. Passive OT monitoring, network telemetry and controlled log forwarding may reduce interference with production assets. Test the entire route in a tabletop exercise: alert, technical triage, quality impact, legal assessment, management escalation, authority submission and follow-up. 3.3 Article 20: management responsibility and board-level evidence Management bodies must approve the Article 21 measures, oversee implementation and can be held liable for infringements under national law. Members must follow training, and Member States must encourage regular training for employees. Evidence should show informed oversight, not a ceremonial annual presentation. Provide the board with decisions it can act on: top service risks, overdue high-risk treatments, control effectiveness, significant incidents, recovery-test failures, critical supplier exposure, material exceptions and required investment. Retain agendas, papers, minutes, approvals, challenge and follow-up. Record training content, attendance and an effectiveness check. The Directive also allows competent authorities, in specified circumstances concerning essential entities, to request temporary suspension of a certification or authorisation and a temporary prohibition on certain senior managers exercising managerial functions until deficiencies are remedied. This is a supervisory measure with conditions, not an automatic personal ban after every incident. Avoid overstating it as criminal liability. 3.4 Article 21(2)(d): API, CMO, CRO and logistics risk Map suppliers to the services and products they can affect. Include API and excipient suppliers, CMOs, CROs, testing laboratories, packaging, cold-chain logistics, cloud platforms, managed services, equipment vendors, remote maintenance and single-source technology dependencies. Tier suppliers using impact, access, substitutability, concentration and recovery time. Due diligence should test the evidence relevant to the service: control scope, incident history, privileged access, subcontractors, vulnerability handling, backup and recovery, secure development, geographic concentration and exit feasibility. A questionnaire is a declaration; a certificate has value only after its scope, exclusions and period are checked. Contracts should define minimum controls, incident-notification timing, cooperation, audit or assurance rights, vulnerability handling, subcontractor conditions, data return, continuity and exit. Contract language does not replace monitoring. Record reviews, adverse findings, risk acceptance, compensating controls, owners and expiry dates. 4. Pharma-specific cybersecurity challenges NIS2 does not solve by itself NIS2 states outcomes and minimum risk areas. It does not prescribe how to patch a validated MES, monitor a PLC in a clean manufacturing area or preserve ALCOA+ principles during a cyber response. These decisions require security, quality, engineering and regulatory roles to work from one risk record. 4.1 Secure validated systems without losing validated state A security patch or configuration change can affect the validated state of MES, LIMS, QMS, chromatography, environmental-monitoring or other GxP systems. Delaying every patch is unsafe; applying every patch without assessment is also unsafe. The control objective is a documented, risk-based decision. Connect vulnerability management to change control. Record asset and version, vulnerability severity, exploitability, patient or product impact, exposure, vendor support, proposed change, test scope, rollback, compensating controls and approval. Use GAMP 5 and CSV or CSA principles to scale assurance to the risk of the changed function. Re-test what can affect intended use, data integrity, electronic records, interfaces and critical calculations. Maintain validated state throughout the lifecycle. Periodic review should reconcile configuration, deviations, patches, access, backup, audit trails, incidents and supplier changes. Emergency changes need predefined authority and retrospective quality review. Evidence should make the sequence traceable from threat to decision, test, release and post-implementation monitoring. 4.2 Protect clinical-trial data, IP and patient-related information NIS2 covers entities carrying out R&D activities of medicinal products when the scope and size rules are met. Their risk model must protect availability, authenticity, integrity and confidentiality across protocol design, investigator sites, eCOA, EDC, safety systems, biostatistics, regulatory submissions and partner exchanges. Apply ALCOA+ data-integrity thinking: records should remain attributable, legible, contemporaneous, original, accurate, complete, consistent, enduring and available. Cyber controls must protect the audit trail and the context required to interpret data. Detect bulk data exports, unusual privileged activity, manipulation and unauthorised interface changes. Test restoration of both data and metadata. Privacy belongs in a coordinated but distinct assessment. A single event can create a NIS2 significant-incident question and a GDPR personal-data-breach question with different tests, recipients and deadlines. Maintain one fact base and timeline, then run separate legal decision paths. 4.3 IT/OT convergence in manufacturing and clean areas OT assets often have long lifecycles, vendor constraints, deterministic communications and limited maintenance windows. Standard endpoint agents may be unsupported. A production pause can itself create quality and supply consequences. Treat OT as a distinct engineering risk domain connected to enterprise governance. Begin with passive discovery and verified ownership. Define zones and conduits, restrict remote access, separate safety and control functions from business networks, protect engineering workstations, monitor allowed communications and control removable media. Use compensating controls when patching is not feasible. Confirm that segmentation and fail-safe behaviour do not disrupt real-time control or environmental conditions. Every change should have cyber, automation and quality acceptance criteria. Test during approved windows, document rollback and retain configuration baselines. The evidence package should include current diagrams, firewall rules, remote-access reviews, alert handling, backup or configuration-restore tests and approved exceptions. 5. NIS2, GDPR, MDR and quality systems: one management model NIS2 protects the resilience and security of network and information systems. GDPR protects personal data and creates breach-notification duties. MDR and IVDR govern medical devices and include safety, quality and post-market obligations. GMP and GxP govern product quality and data integrity. One incident can activate several regimes, but the legal tests are not interchangeable. Build one management model with multiple compliance mappings. Use a common service catalogue, asset register, risk method, incident record, supplier register, training process, CAPA workflow and evidence index. Map each control to the applicable NIS2 article, national law, GDPR requirement, quality procedure and device obligation. This reduces duplicate evidence without collapsing distinct decisions. Create a regulatory decision matrix before an incident occurs. For each regime, record the trigger, decision owner, recipient, deadline, minimum content and rule for follow-up. Add contractual notifications and communications to investigators, insurers, partners and affected customers. During an incident, one coordination lead should maintain the verified facts, while qualified owners make the separate legal and quality decisions. This model reduces contradictory reporting without allowing the shortest deadline to erase the distinct tests applied by each regime. ISO/IEC 27001 can provide a useful information-security management structure; it does not by itself prove NIS2 scope, registration or national reporting compliance. ISO/IEC 42001 can support governance where AI is used in LIMS analytics, quality review or security operations, but AI controls still require validation, data-integrity assessment and human oversight appropriate to the use case. Design an integrated incident form with separate sections for service impact, product and patient impact, personal data, regulatory status, notification decisions and communications. The same verified timeline can support the CSIRT, data-protection authority, quality unit and management without creating contradictory versions. 6. Penalties and enforcement: the cost of non-compliance Article 34 requires Member States to provide maximum administrative fines for essential entities of at least EUR 10 million or at least 2% of worldwide annual turnover in the preceding financial year, whichever is higher. For important entities, the corresponding levels are at least EUR 7 million or 1.4%, whichever is higher. National law determines the applicable enforcement process and may set higher maximums or additional measures. Fines are only one exposure. A cyber incident can generate lost sales, scrapped batches, delayed trials, recovery costs, contractual claims, privacy consequences and loss of confidence. Merck reported that its 2017 network attack disrupted manufacturing, research and sales, reduced 2017 sales by approximately USD 260 million and generated USD 285 million of manufacturing and remediation expense net of stated insurance recoveries; residual backlog affected 2018 sales by approximately USD 150 million. Do not justify controls only by comparing programme cost with the statutory maximum. Prioritise by service impact, credible threat, control weakness and legal duty. The board should see both compliance exposure and the operational loss scenario for each critical product or service. 7. A 9-12 month NIS2 implementation roadmap for pharma A 9-12 month programme can organise remediation, but it is not a legal grace period. Organisations already subject to national implementing law must meet current duties while improving maturity. Sequence work around critical risk and approved change windows in validated environments. 7.1 Step 1: scope and gap analysis Confirm each legal entity’s status and jurisdiction. Inventory critical services and products, then map IT, OT, laboratory, clinical, data, facility, people and supplier dependencies. Assess the ten Article 21 areas and national obligations. The assessment should produce an approved scope memorandum, jurisdiction register, service and dependency map, asset baseline, gap report, risk-ranked remediation plan and evidence index. Escalate any unknown externally exposed asset or unsupported critical system immediately. 7.2 Step 2: governance and accountability Assign executive sponsorship, service owners, control owners and an incident-reporting authority. Define RACI across security, IT, OT, engineering, quality, privacy, legal, procurement, HR, communications and business continuity. At this stage, the organisation should have a governance charter, RACI, management reporting pack, risk-acceptance thresholds, training plan, CSIRT contact matrix and defined authority for isolating production or laboratory systems. 7.3 Step 3: technical and organisational controls Prioritise identity, privileged access, MFA, network segmentation, secure remote access, EDR where supported, passive OT monitoring, central logging, vulnerability management, protected backups and recovery. Connect each change to quality and validation procedures. Completion is evidenced by approved architectures, control requirements, implementation records, validation or assurance evidence, coverage metrics, an exception register and tested rollback. Measure the population covered, not only whether a tool was purchased. 7.4 Step 4: supplier verification and continuous monitoring Tier API, CMO, CRO, laboratory, logistics, cloud, software and maintenance suppliers. Run due diligence proportional to access and impact. Remediate contracts and establish monitoring triggers. The operational output is a maintained supplier register supported by a criticality model, evidence reviews, risk decisions, security clauses, incident contacts, a monitoring schedule, concentration analysis and exit plans. Reassess after a material change or incident. 7.5 Step 5: build and test incident response Create playbooks for ransomware, data exfiltration, validated-system compromise, OT disruption, supplier incident and loss of a critical cloud service. Include quality and regulatory decisions, not only technical containment. The response capability should be documented in an incident plan, 24/72-hour and final-report templates, a significance assessment, an evidence-preservation method and a tabletop report. Run the exercise with executives and on-call personnel. Track corrective actions to verified closure. 7.6 Step 6: document, audit and sustain Convert control operation into evidence by design. Automate controlled reports where practical, identify record owners and set retention based on national law, sector duties, investigation needs and risk. Review the programme after incidents, major changes and legal updates. The programme closes with a controlled policy set, evidence index, management minutes, training records, incident and supplier files, recovery-test results, effectiveness testing, an internal-audit report and a CAPA register. Independent review should confirm closure of high-risk findings. 8. Documented cyber incidents: practical NIS2 lessons Public incident reports rarely prove which internal control failed. Use them to test plausible scenarios, not to accuse an organisation of a control deficiency that has not been established. Merck’s 2017 attack demonstrates that enterprise malware can reach manufacturing, research, sales and fulfilment at the same time. The NIS2 lesson is to map shared dependencies, segment environments, protect recovery capabilities and quantify product-level continuity. Exercise the decision to isolate a plant system when isolation may interrupt production. The 2020 cyberattack on the European Medicines Agency unlawfully accessed documents related to COVID-19 medicines and vaccines. EMA reported that some leaked material, including correspondence, had been manipulated before publication. The lesson is broader than confidentiality: protect authenticity, integrity and provenance across regulator and partner exchanges, and prepare communications for manipulated or incomplete data. Cencora disclosed in February 2024 that data had been exfiltrated from its information systems and might contain personal information. It stated at the time that operations remained functional and that containment, investigation, law-enforcement engagement and external support had begun. The lesson is to maintain rapid cross-functional triage even when availability is not affected: exfiltration can still trigger NIS2, privacy, contractual and trust decisions. For each scenario, retain the alert timeline, affected services, evidence sources, quality assessment, reporting decision, management escalation and corrective actions. Link lessons to Article 21 controls and test whether the same evidence could support the 24-hour early warning. 9. Selecting expert support for NIS2 implementation A pharma NIS2 partner must combine cybersecurity, regulated quality and implementation capability. Ask for evidence that the team can classify scope, map services, design IT/OT controls, manage validated change, build CSV or CSA evidence, assess suppliers, run incident exercises and explain residual risk to management. Evaluate the delivery model. A one-time gap report does not sustain compliance. Managed services can operate monitoring, vulnerability triage, evidence collection and supplier review, but accountability remains with the regulated organisation and its management. Define ownership, escalation, service levels, evidence access and exit from the start. Request sample deliverables before selection: a redacted scope memorandum, an Article 21 traceability matrix, a validated change package, an OT risk assessment, a supplier finding and an executive incident exercise report. Check whether conclusions identify assumptions, evidence and residual risk. Confirm that security specialists can work with quality, automation and legal teams, and that records can be transferred into the organisation’s controlled repositories. The partner should leave the organisation with an operating process and usable evidence, not a slide deck that cannot be maintained. TTMS combines an ISO/IEC 27001 information-security management environment with pharmaceutical computerized-system validation services aligned to GAMP 5 and Annex 11. Its published quality offering covers CSV and CSA across the system lifecycle. In February 2026, TTMS reported becoming the first Polish company to obtain accredited ISO/IEC 42001 certification for its AI management system after an audit by TÜV Nord Poland. These credentials are relevant where cyber controls, validated systems and governed AI must remain auditable in one operating model. To arrange a scoping call focused on legal entities, regulated services, critical products, validated systems, OT dependencies and current evidence, contact TTMS. The first output should be a defensible scope and prioritised action plan—not a generic control catalogue. 10. Frequently asked questions about NIS2 cybersecurity in pharma Does NIS2 apply to every pharmaceutical company? No. Scope depends on activity, size, establishment, national law and designation. Manufacturing and medicinal-product R&D are listed; marketing or distribution alone may lead to a different result. Document the conclusion for each legal entity. Is every pharmaceutical manufacturer an essential entity? No. Annex I classification does not automatically make every manufacturer essential. Size thresholds, Article 3 rules, exceptions and national decisions determine whether an organisation is essential, important or outside scope. Group companies may reach different conclusions. What are the main NIS2 incident-reporting deadlines? For a significant incident, the Directive sets an early warning within 24 hours of awareness, an incident notification within 72 hours and a final report within one month. National procedures and parallel duties under GDPR or sector rules must also be checked. How do NIS2, GxP and Annex 11 interact in pharmaceutical environments? NIS2 governs cyber risk and resilience; GxP and Annex 11 govern product quality, data integrity and computerized systems. Use one risk and change-control model while preserving separate legal assessments and validation evidence for security changes. How should security patches be handled in validated GxP systems? Route the vulnerability through risk assessment and controlled change. Document exploitability, product or patient impact, test scope, rollback and compensating controls. Apply CSV or CSA assurance proportionate to the affected function and retain traceability from the vulnerability to approval and post-change review.

LXP vs LMS: Which Platform Wins in 2026?

LMS and LXP platforms solve different learning challenges. An LMS is designed to manage, deliver, and track structured training, while an LXP focuses on personalized, learner-driven learning and continuous skill development. Many organizations don’t choose one over the other. Instead, they use both to support different learning objectives. Choosing between them can feel a bit like deciding between a library and a streaming service. One organizes learning in a structured way, while the other helps people discover relevant content based on their interests, goals, and previous activity. It’s a simple comparison, but it captures why the LMS vs LXP discussion continues to shape corporate learning strategies. From our experience working with enterprise learning programs, one of the most common misconceptions is that an LXP is simply a newer version of an LMS. In reality, the two platforms serve different purposes. Organizations that see the best learning outcomes typically treat them as complementary technologies, using each where it delivers the greatest value. Understanding those differences is essential before investing in a learning platform. The right choice depends not only on the features you need today but also on how your organization plans to develop skills, manage compliance training, and support continuous learning over time. 1. LXP vs LMS: Understanding the Core Difference Before You Choose Who actually drives the learning experience? With an LMS, the organization does. Administrators design structured courses, assign them to learners, and track completion. With an LXP, the learner takes ownership. The platform surfaces relevant content, suggests next steps, and encourages exploration. Think of an LMS as a formal curriculum and an LXP as a personalized learning feed. Neither is inherently superior. What matters is whether the platform fits your learning strategy, your workforce profile, and the outcomes you’re actually trying to drive. That distinction also shapes how your L&D team operates, how your IT infrastructure connects, and how your employees feel about learning at work. 2. What Is an LMS? Purpose, Features, and Best-Fit Use Cases A Learning Management System is the backbone of corporate training in most organizations. It centralizes, delivers, and tracks formal learning, particularly in environments where consistency and compliance aren’t optional. Onboarding new hires and certifying staff in regulated industries are two of its most common applications, and in both cases the LMS provides the structure that keeps programs running reliably at scale. 2.1 How an LMS Structures and Delivers Learning An LMS organizes content into predefined courses and learning paths. Learners receive assignments, complete modules in sequence, pass assessments, and receive certificates or completion records. Everyone in a given role or department ends up meeting the same standard. This works well when the goal is measurable competency. A new safety technician needs to complete specific modules before working on-site. A financial advisor must pass compliance training before advising clients. The LMS produces a clear, documented trail of who learned what and when, which is often a legal requirement rather than just an internal preference. 2.2 Core LMS Features That Drive Compliance and Administration A strong LMS is built around control, structure, and governance. It helps administrators track completion rates, assessment results, certification status, and mandatory training progress without digging through separate files or manual reports. It also supports role-based enrolment, automated reminders, and audit-ready documentation, which is why LMS platforms remain essential in regulated sectors such as healthcare, finance, manufacturing, and aviation. The problem starts when organizations expect an LMS to create the whole learning experience. Most LMS platforms are not designed to spark curiosity, recommend content based on individual goals, or make learning feel self-directed. They are excellent at answering the question: “Has this person completed the required training?” They are usually weaker at answering: “What should this person learn next to grow in their role?” That is the gap an LXP is designed to fill. 3. What Is an LXP? Purpose, Features, and Best-Fit Use Cases A Learning Experience Platform puts learners at the center. Rather than assigning fixed courses, an LXP pulls content from multiple sources, curates it based on individual preferences and goals, and surfaces what’s most relevant to each person. It ends up feeling more like a professional development hub than a training portal. 3.1 How an LXP Personalizes and Surfaces Learning Personalization in an LXP relies on AI and machine learning to analyze how each learner interacts with the platform: what topics they engage with, what skills they’ve listed, what their peers in similar roles explore. A software engineer who watches content on cloud architecture will see more relevant resources appear in their feed. A marketing manager who finishes a course on data analytics might get suggestions on audience segmentation or attribution modeling. That kind of timely relevance is what keeps learning from feeling static. The results are measurable. 88% of LXP users agree that an LXP provides a better learning experience than a traditional LMS, and 58% of HR leaders report improved training ROI through AI-curated learning journeys, which is the core capability LXPs are built around. 3.2 Core LXP Features That Drive Engagement and Discovery An LXP is strongest when learning is not limited to assigned courses. It helps employees discover relevant content, follow their interests, and learn from people inside the organization. Instead of relying only on a fixed training catalogue, an LXP can bring together content from internal knowledge bases, external providers, videos, podcasts, articles, and expert recommendations. Social learning features add another layer: employees can recommend resources, comment on materials, share achievements, and learn from colleagues who face similar challenges. This is where an LXP becomes more than a content library. With user-generated content, internal subject matter experts can contribute practical knowledge from real projects, customer cases, tools, or processes. From our experience, this often makes the platform more valuable than a polished but generic course catalogue because employees trust knowledge that comes from people who understand their daily work. The limitation is compliance. If every employee must complete a specific data privacy course by a regulatory deadline, an LXP alone is usually not enough. It may help people discover useful learning, but it does not give administrators the same level of tracking, audit readiness, or enforcement as an LMS. An LXP also needs the right learning culture. If employees see training only as a mandatory task, recommendation engines and social learning features will not create engagement by themselves. In that case, an LXP works best when supported by clear learning paths, manager involvement, and LMS-style structure. 4. LXP vs LMS: Side-by-Side Comparison When comparing LMS and LXP platforms directly, four dimensions reveal the most meaningful differences. In an LMS, administrators own the content entirely. They create, approve, and manage every piece of material learners encounter. An LXP opens that up to multiple contributors, including learners and internal experts, but doing that well requires a governance strategy to keep quality from slipping. Control also works differently in each system. Administrators in an LMS define learning paths, set deadlines, and decide what’s available to whom. In an LXP, learners build their own playlists and search topics that interest them, finding their own way through available content. On reporting, LMS platforms generate detailed audit logs and the documentation compliance officers need during inspections. LXP analytics focus on engagement, content popularity, and skill progression. That data is genuinely useful for L&D strategy, but it doesn’t replace compliance-grade reporting. Integration priorities differ too. An LMS typically connects with HRIS systems, SSO providers, and payroll platforms. An LXP tends to offer broader connectivity with external content libraries, collaboration tools, and skills databases, increasingly linking learning activity to performance management and career development. 5. How to Choose Between an LXP and LMS for Your Organization There’s no universal answer. The right choice depends on your workforce, your industry, your culture, and what you’re ultimately trying to achieve. In our experience helping organizations across healthcare, financial services, and technology evaluate platforms, the compliance question almost always comes first. Everything else tends to follow from there. An LMS is the right fit when compliance, standardization, and accountability are the primary goals. Healthcare providers certifying staff on patient safety protocols, financial institutions managing mandatory regulatory training, and any organization where incomplete training carries legal or operational consequences should build their learning infrastructure around a well-built LMS. An LXP suits organizations that want to build a learning culture rather than simply manage a training program. Companies in technology, creative industries, and professional services often find their workforce learns best through discovery, peer recommendation, and self-directed exploration. An LXP also works well for organizations trying to retain high performers by investing visibly in their career development. 5.1 When You Need Both: The Hybrid Approach 70% of new enterprise learning contracts now specify an LXP component, which reflects how commonly organizations are choosing to run both platforms rather than picking one. The two serve genuinely different purposes, and combining them creates a more complete learning setup than either alone. In a hybrid model, the LMS handles mandatory and compliance-driven training with the rigor and documentation that requires. The LXP sits alongside it, giving employees space to explore voluntary learning, develop skills beyond their current role, and engage with content from diverse sources. A practical example: a 1,500-person financial services organization arrived at a hybrid approach after realizing their compliance certification was well-managed in an LMS, but their technology and operations teams had no structured path for continuous upskilling. By integrating an LXP alongside the existing LMS and connecting both to a shared skills framework, they could enforce regulatory deadlines through the LMS while giving employees a self-directed track for career development. The L&D team gained a unified view of both mandatory completions and voluntary engagement, which made it possible to have more informed conversations about skill gaps at the team level. This integrated approach works particularly well in mid-to-large organizations carrying both compliance responsibilities and genuine ambitions around building a stronger learning culture. 6. How AI Is Reshaping LXP and LMS Platforms in 2026 AI is no longer a future feature in learning platforms. It’s already changing how both LMS and LXP systems work. In LXP systems, AI drives the core personalization engine, making content recommendations sharper and more contextually relevant as the system learns more about each user. In LMS platforms, AI is changing the administrative side: automated tagging reduces manual cataloging work, adaptive assessments adjust difficulty based on performance, and predictive analytics can flag learners at risk of missing compliance deadlines before those problems escalate. At TTMS, we help organizations work through this shift in practice. That means evaluating existing learning infrastructure, identifying where AI adds genuine value, and integrating both platforms into a broader IT setup. The most common mistake we see is organizations deploying an LXP without a minimum content governance framework in place first. Without that structure, user-generated content can erode platform trust quickly, and the self-directed learning culture the LXP was meant to build never really takes hold. 7. The Verdict: Which Platform Wins in 2026? Neither platform is the clear winner, and that is the most practical answer. An LMS is still the stronger choice for structured, compliance-driven training, especially in regulated industries where tracking, reporting, and certification management are non-negotiable. An LXP solves a different problem. It supports discovery, personalization, and continuous skill development in ways a traditional LMS was not designed to deliver. The important shift heading into 2026 is that the line between LMS and LXP platforms is becoming less rigid. AI is making LMS systems more adaptive, while LXP platforms are adding more structure around learning paths, reporting, and compliance support. Vendors are also building tighter integrations and, in some cases, offering combined environments that bring both approaches together. For most organizations, the right decision starts with clarity. Define the learning outcomes you need to achieve, understand what keeps your employees engaged, and assess your compliance requirements honestly. Then choose the platform, or combination of platforms, that matches those realities. The best learning platform is not the newest one. It is the one that fits the work your organization actually needs learning to support. If your organization needs… Choose Why? Mandatory training and regulatory compliance LMS Provides structured training management, certification tracking, reporting, and audit-ready documentation. Employee onboarding LMS Delivers standardized learning paths and ensures every new employee completes the required training. Continuous employee upskilling LXP Recommends personalized learning content based on individual skills, interests, and career goals. Building a learning culture LXP Encourages self-directed learning, knowledge sharing, and ongoing professional development. Compliance training in regulated industries LMS Offers robust reporting, certification management, and compliance monitoring. Career development and skills growth LXP Helps employees develop new capabilities through personalized recommendations and learning journeys. Leveraging internal expert knowledge LXP Makes it easy for subject matter experts to create and share valuable organizational knowledge. Managing both compliance and continuous learning LMS + LXP Combining both platforms provides structured compliance management while supporting personalized employee development. FAQ What is the difference between an LMS and an LXP? An LMS (Learning Management System) is designed to deliver, manage, and track structured training programs. It is commonly used for onboarding, compliance training, certifications, and mandatory learning. An LXP (Learning Experience Platform) focuses on personalized, learner-driven development. It recommends relevant content based on each employee’s skills, interests, and learning goals, helping support continuous learning beyond required courses. When should an organization choose an LMS? An LMS is the right choice when training must be standardized, assigned, and documented. It is particularly valuable for organizations operating in regulated industries where compliance, certifications, reporting, and audit-ready records are essential. Healthcare, financial services, manufacturing, and aviation are common examples. When is an LXP a better option? An LXP is best suited for organizations that want to encourage continuous learning and employee development. It works particularly well when employees are expected to build new skills independently, access learning from multiple sources, and receive personalized recommendations based on their interests and career goals. Can an LMS and an LXP work together? Yes. Many organizations use both platforms as part of the same learning ecosystem. The LMS manages mandatory training, compliance, and certifications, while the LXP supports self-directed learning, knowledge sharing, and continuous skills development. Together, they provide a more complete learning experience than either platform alone. Can an LXP replace an LMS? In most cases, no. While an LXP offers a better experience for personalized learning, it typically lacks the governance, reporting, certification management, and compliance capabilities required for mandatory corporate training. Organizations with regulatory obligations usually continue to rely on an LMS while adding an LXP to support employee development. How is AI changing LMS and LXP platforms? Artificial intelligence enhances both platforms in different ways. In LMS platforms, AI automates tasks such as content tagging, adaptive assessments, reporting, and predictive analytics. In LXP platforms, AI improves personalization by recommending learning content based on each employee’s role, behavior, interests, and skills. The greatest value comes from combining AI with high-quality, well-governed learning content. Which platform is better for compliance training? An LMS is the better choice for compliance training because it provides structured learning paths, completion tracking, certification management, automated reminders, and audit-ready reporting. These capabilities help organizations demonstrate compliance with internal policies and external regulations. How do you choose the right learning platform? The right choice depends on your organization’s goals. If your priority is regulatory compliance and standardized training, an LMS is usually the best option. If your goal is to build a culture of continuous learning and personalized employee development, an LXP may be a better fit. Many organizations achieve the best results by combining both platforms to support different learning objectives.

AI End-to-End Testing: Complete Guide for 2026

Software testing has never been more demanding. Applications are larger, release cycles shorter, and user expectations higher than ever. QA teams are under pressure to validate complex workflows across layered tech stacks, often while fighting fires caused by tests that break the moment a developer pushes a UI update. AI end-to-end testing is changing that dynamic in a meaningful way, not by patching over old problems, but by rethinking how testing works from the ground up. This guide covers everything from why traditional automation falls short to how AI-driven platforms deliver faster, more resilient testing without inflating your team’s workload. 1. Why End-to-End Testing Becomes Unmanageable Without AI Modern web applications are more complex than ever. A single user journey often spans multiple screens, integrations, business processes, and application components. Testing these workflows manually is time-consuming, while traditional script-based automation can become difficult to maintain as applications evolve. The deeper issue is that testing complexity grows faster than QA teams can scale. Every new feature, workflow, or integration adds to the testing effort, making it harder to maintain coverage without increasing maintenance overhead. Adding more testers or creating more scripts doesn’t solve the underlying problem. It simply postpones it. 1.1 Why QA Teams Spend More Time Fixing Tests Than Writing Them Ask many QA teams where most of their time goes, and the answer is often surprising. Instead of expanding test coverage or improving quality processes, a significant portion of effort is spent maintaining existing test assets. As applications evolve, even small interface changes, updated workflows, or modified business logic can cause automated tests to become outdated and require manual intervention. This creates an ongoing maintenance cycle that limits the value teams get from automation. Rather than focusing on new features, risk-based testing, or improving release confidence, testers spend their time updating scripts, reviewing failures, and keeping existing suites aligned with the application. Over time, the effort required to maintain automation can grow faster than the test suite itself, making it difficult for teams to scale testing as products become more complex. 1.2 The Real Cost of Flaky Tests in CI/CD Pipelines Flaky tests are more than just a technical nuisance. When tests fail inconsistently, teams lose confidence in the entire automation process. Developers become less willing to trust test results, failures are re-run repeatedly to confirm whether they are real, and genuine defects can be overlooked because they appear alongside unreliable test outcomes. The impact extends beyond QA. Unstable tests slow down CI/CD pipelines, delay release decisions, and increase the amount of manual investigation required before changes can move forward. Instead of accelerating delivery, automation becomes another system that needs constant attention. This is why modern QA teams increasingly focus not only on expanding automation coverage but also on reducing maintenance overhead, improving test reliability, and ensuring that test results remain a trustworthy source of feedback throughout the development lifecycle. 1.3 Why Traditional Automation Doesn’t Scale with Product Complexity Script-based automation was designed for simpler software landscapes. It works reasonably well when interfaces are stable and workflows predictable. But modern applications change continuously. New features ship weekly, UI frameworks get upgraded, and integrations multiply. Traditional end-to-end automation responds by requiring more scripts, more maintenance, and more specialists. At some point, the cost of maintaining automation exceeds its value, and teams either abandon coverage or accept that their safety net has holes in it. 2. What Is AI End-to-End Testing (and Why It Changes Everything)? AI end-to-end testing is more than traditional automation enhanced with AI features. It introduces a different approach to creating, maintaining, and managing tests across complex user journeys. 2.1 From Script-Based Tests to AI-Driven Workflows Traditional end-to-end automation is highly dependent on manually created scripts, where testers define specific steps, selectors, and expected outcomes. While this approach can be effective, maintaining those scripts becomes increasingly difficult as applications evolve. AI-assisted testing introduces a more flexible workflow. Instead of starting every test from scratch, teams can use requirements, tickets, release notes, and natural-language descriptions as inputs for creating test scenarios and supporting automation efforts. This helps reduce manual effort while keeping testers in control of validation and decision-making. Rather than focusing solely on predefined interactions, AI can help teams maintain alignment between business requirements and testing activities as products grow in complexity. The result is a more scalable approach to end-to-end testing, where teams spend less time creating and maintaining test assets and more time focusing on quality outcomes. 2.2 How AI Understands User Flows Instead of Hardcoded Steps Traditional automated tests are built around predefined actions and expected outcomes. While effective, these tests often require regular updates as applications evolve and user journeys change over time. AI-assisted testing introduces additional context into the process. Instead of relying exclusively on manually created scripts, teams can use requirements, tickets, release notes, and other project documentation to help generate and organize test scenarios. This creates a stronger connection between business requirements and testing activities. By supporting test creation and maintenance throughout the development lifecycle, AI helps teams keep pace with changing applications without relying entirely on manual updates. The result is a more scalable testing workflow that reduces administrative effort while keeping human oversight and validation at the center of the process. 3. Core AI Capabilities That Eliminate Testing Bottlenecks The value of AI in end-to-end testing goes beyond speed. Modern AI-powered platforms help teams reduce repetitive tasks, improve consistency, and keep testing activities aligned with rapidly changing applications. 3.1 AI-Assisted Test Creation One of the most practical applications of AI is helping teams create test cases faster. Instead of starting from a blank page, QA teams can use requirements, tickets, release notes, and other project documentation as inputs for generating draft test scenarios. This reduces manual effort while maintaining human review and validation throughout the process. 3.2 Smarter Regression Planning As test suites grow, deciding which tests should be executed becomes increasingly difficult. AI can support regression planning by helping teams identify the most relevant test suites based on the scope of change, allowing them to focus testing efforts where they are most valuable. For example, QATANA uses this approach by helping teams select regression suites based on ticket content and release information, reducing administrative overhead while supporting more efficient release cycles. 3.3 Reduced Maintenance Effort Maintaining test assets is often one of the most time-consuming parts of automated testing. AI-powered workflows can help teams keep test documentation, test cases, and automation assets aligned with evolving product requirements, reducing the amount of manual effort required to keep testing assets current. 3.4 Improved Visibility Across the Testing Lifecycle AI can also support better decision-making by helping teams organize testing information, identify relevant testing activities, and maintain visibility across both manual and automated workflows. When combined with reporting and traceability capabilities, this helps QA teams make more informed decisions throughout the release process. 4. How AI E2E Testing Tools Actually Work in Practice Understanding the benefits is useful, but the more important question is how these tools operate within a real development environment. 4.1 From Natural Language to Executable Tests The path from requirements to executable tests varies between platforms, but the overall goal remains the same: reducing the amount of manual effort required to create and maintain automation. Modern AI-powered testing tools can use requirements, tickets, release notes, and natural-language descriptions as inputs for generating draft test scenarios and supporting automation workflows. Some platforms generate executable test code directly in frameworks such as Playwright, while others focus on assisting teams with test creation, organization, and maintenance. Regardless of the approach, the objective is to help teams move from business intent to test execution faster while keeping validation and decision-making under human control. 4.2 Continuous Learning and CI/CD Integration Successful AI testing platforms do not operate in isolation. They integrate with the tools teams already use, including issue tracking systems, automation frameworks, and CI/CD pipelines. This allows AI-assisted testing activities to become part of existing development workflows rather than introducing a separate process. For example, QATANA follows this approach by integrating with Jira, Playwright, and CI/CD environments while providing a single view of both manual and automated testing activities. By bringing test management, automation, and reporting into one environment, teams can reduce fragmentation and improve visibility across the testing lifecycle. 5. What to Look for in an AI End-to-End Testing Platform Choosing the right AI end-to-end testing platform requires looking beyond marketing claims and focusing on the capabilities that deliver practical value in day-to-day QA work. 5.1 AI Should Reduce Maintenance, Not Create More Work One of the biggest challenges in test automation is maintaining test assets as applications evolve. AI-powered platforms should help teams reduce the effort associated with creating, updating, and organizing tests rather than introducing additional layers of complexity. Features such as AI-assisted test generation, support for maintaining test assets, and intelligent regression planning can significantly reduce administrative overhead. Just as importantly, AI should support human decision-making rather than replace it. The most effective platforms combine automation with clear review and validation workflows, ensuring that teams remain in control of what gets tested and how test results are interpreted. 5.2 Key Features That Reduce QA Overhead When evaluating AI testing platforms, focus on features that have a measurable impact on productivity and quality. These may include AI-assisted test creation from requirements or project documentation, intelligent regression suite selection, unified visibility across manual and automated testing activities, real-time reporting, and integrations with existing tools such as issue trackers, automation frameworks, and CI/CD pipelines. The most successful implementations are typically those that fit naturally into existing QA processes, helping teams spend less time maintaining testing assets and more time improving product quality. 6. Best Practices for Sustainable AI E2E Testing Deploying AI testing tools is only part of the journey. Long-term success depends on processes that keep testing reliable, maintain trust in results, and ensure AI supports rather than complicates QA workflows. 6.1 Focus on High-Value User Journeys Start with the workflows that have the greatest impact on users and business outcomes. Critical paths such as onboarding, purchasing, account management, and key business processes should be prioritized before expanding automation coverage. Focusing on the areas with the highest risk and business value creates more sustainable results than attempting to automate everything at once. 6.2 Balance Speed with Reliability Fast test execution has little value if teams cannot trust the results. AI-assisted testing should support reliable feedback by helping teams maintain consistent test assets, reduce unnecessary maintenance effort, and keep testing activities aligned with evolving application requirements. The goal is not simply to run more tests but to generate meaningful signals that support release decisions. 6.3 Keep Humans in Control Human oversight remains essential. AI can accelerate test creation, support regression planning, and reduce repetitive work, but experienced QA professionals are still responsible for validating requirements, reviewing AI-generated outputs, and making quality decisions. The most successful teams use AI as a productivity tool rather than a replacement for human expertise. 6.4 Measure and Improve Continuously AI testing should be treated as part of an ongoing quality engineering process. Monitoring test stability, maintenance effort, execution trends, and coverage over time helps teams identify opportunities for improvement while maintaining confidence in automation. Real-time dashboards and centralized reporting can provide the visibility needed to keep testing activities aligned with product quality goals. 7. How Qatana Approaches AI End-to-End Testing Differently We built Qatana to help QA teams scale testing without proportionally increasing effort, headcount, or maintenance overhead. Rather than treating AI as a standalone feature, we designed it as a core part of the testing workflow, helping teams move from requirements to automation faster while maintaining full human oversight. Unlike traditional test management platforms that primarily focus on storing and organizing test assets, Qatana helps teams generate draft test cases from requirements, tickets, and release notes. This creates a direct connection between business intent and testing activities, reducing the manual effort typically required to translate requirements into actionable test scenarios. Qatana also helps teams streamline regression planning by identifying the most relevant test suites based on project changes. Instead of manually reviewing large repositories before every release, teams can focus their efforts on the tests that matter most while maintaining visibility across both manual and automated testing activities. Built with modern QA workflows in mind, Qatana integrates with Jira, Playwright, and CI/CD environments, allowing teams to work within their existing delivery process rather than introducing additional tooling complexity. Built-in tutorials, intuitive navigation, and bulk import capabilities help reduce onboarding effort and accelerate adoption across QA teams. For organizations operating in regulated or security-sensitive environments, Qatana offers additional advantages through on-premise deployment, audit-ready logs, role-based access controls, and support for enterprise governance requirements. Combined with AI-assisted test generation and unified test management, this allows teams to modernize QA processes while maintaining control, traceability, and compliance-oriented workflows. The result is a platform that helps organizations reduce repetitive QA work, improve collaboration between manual and automation teams, and scale testing more efficiently as applications grow in complexity. If you’re exploring how AI can help your team scale end-to-end testing without increasing maintenance overhead, we’d be happy to show you how Qatana works in practice. Contact us to schedule a tailored demo and discuss your QA goals.

GPT-Powered AI Agents: How to Match Autonomy to the Process?

Until recently, enterprise automation followed a simple division: systems performed tasks defined by rules, while cases requiring interpretation were passed to people. GPT-powered AI agents expand the range of processes that can be supported through automation. They can work with documents, incomplete data and the language used by customers or employees, making them suitable for processes that were previously difficult to automate. For large organisations, this raises a practical question about AI agent autonomy: where does expert support end, and where does independent action within a process begin? In some situations, the agent’s role is to gather information and prepare a recommendation. In others, it prepares an action for approval. There are also areas where it can independently carry out repetitive steps when the organisation has defined the rules, permissions, limits and exception-handling paths. GPT-powered AI agents can already support teams with ticket handling, document analysis, decision preparation, data updates and multi-step tasks. The key implementation question is: which decisions and actions should remain with people, and which can an agent perform within agreed rules? An AI agent in the enterprise is a process participant, not just a chatbot In practice, a GPT-powered agent needs five elements: access to reliable sources of knowledge, a clearly defined business objective, tools and integrations with enterprise systems, permissions aligned with its role, rules that define the boundaries of its actions. A language model can interpret the content of a document, a customer message or an incident description effectively. It does not, however, replace a business process. Workflows, permissions, validations and decision history are what make an agent operate predictably, even when it handles hundreds or thousands of cases each month. Three levels of AI agent autonomy In a large organisation, it is worth designing agents across three levels. This allows autonomy to grow alongside process maturity and trust in the solution. Operating level Agent’s role Example tasks Human role Level 1: Advisory agent Analyses information and prepares a recommendation. Case summary, risk identification, proposed response, ticket prioritisation. Makes the decision and carries out the action. Level 2: Agent preparing an action for approval Completes the next steps in a process, stopping before actions with significant consequences. Creates an application, updates data, prepares a communication, submits an instruction for approval. Reviews and approves specified steps. Level 3: Agent performing tasks automatically Independently carries out tasks in line with the process policy. Case classification, status updates, sending standard information, creating a task in a system. Handles exceptions, monitors quality and updates process rules. The level of autonomy does not need to apply to the entire agent. The same agent may independently classify tickets, prepare a response that requires approval and transfer unusual cases to an expert. In practice, an organisation therefore designs autonomy for individual decisions and actions, rather than choosing a single operating model for the whole solution. What determines whether an AI agent can complete a task independently? A useful starting point is to assess two factors: the impact of the action on the organisation and whether it can be reversed. The greater the business, legal, financial or reputational consequences of a decision, the more important human approval becomes. Nature of the action Recommended model Low impact, simple rules, easy to reverse Automatic execution with a record in the process history. Medium impact, data from several sources, possible exceptions The agent prepares the action and an authorised person approves it. High financial, legal or customer impact The agent presents analysis, options and justification. The decision remains with a person. Unclear rules, incomplete data or conflicting information Automatic escalation to an expert, together with the context and collected data. This principle is particularly useful in organisations operating across multiple countries, with complex permission structures and a large number of systems. Just as important as the list of tasks is knowing what the agent must not do and when it should hand a case over to a person. 7 questions to ask before giving an AI agent permission to act What action should the agent perform? Describe it specifically, for example: “create a service ticket”, “update contact details” or “prepare a response to a complaint”. What data will it work with? Identify the sources, data owners, update frequency and access rules. What business rules must it follow? These may include financial limits, contractual terms, SLA levels, compliance requirements or communication policies. What exceptions should stop the process? The agent needs a clear escalation path for unusual or incomplete cases, or those requiring specialist assessment. Can the action be reversed? The ease of correction affects the appropriate level of autonomy, the scope of testing and the need for additional approval. Who is accountable for the decision? The process owner, approver and technical team should all have clearly assigned roles. How will the organisation establish why the agent took a particular action? The case history should show the input data, rules, sources used, recommendation and process outcome. This is why AI agent projects often begin with bringing the process itself into order. The organisation gains more than a new AI capability: it also gains better visibility of responsibilities, exceptions and how work actually flows. Where can GPT-powered AI agents add value in a large enterprise? Customer service and back-office teams An agent can read a customer message, identify its subject, retrieve data from a CRM or case-management system, prepare a response in line with company policy and route it to the appropriate queue. For standard cases, it can also update a status, create a task for the team or send the customer a confirmation. Full autonomy works well for low-risk actions, such as providing information about the status of a ticket. Complaints, individual commercial terms or cases requiring interpretation of a contract should be passed to an employee together with the agent’s analysis. Finance, procurement and document workflows An AI agent can read a document, check whether the data is complete, compare it with a purchase order and flag discrepancies that require clarification. It can also prepare a case summary, collect missing information and initiate the appropriate approval workflow. Decision thresholds are particularly important in this area. The agent can process a document automatically when it meets all conditions, while cases that exceed a defined amount, contain discrepancies or concern a new supplier can be submitted for approval. IT, administration and ticket management In an IT environment, an agent can classify tickets, create an incident summary, search for similar cases in the knowledge base, propose actions in line with a runbook and update the user on progress. In administrative processes, it can prepare an application, complete data in a form and remind the requester about missing documents. For actions involving configuration changes, access permissions or production systems, an approval-based model is advisable. The agent reduces the time needed to prepare a decision, while the administrator retains control over the change. Sales and commercial information management An agent can prepare a briefing before a meeting by bringing together information from the CRM, proposals, correspondence and notes, then highlighting open points and suggested next steps. After the meeting, it can create a summary, propose data updates and prepare tasks for the team. These are extensions of scenarios already familiar from everyday work with generative AI. Read more about what the current generation of models helps teams achieve in our article: GPT-5.6 from OpenAI: capabilities and business applications. Why does an AI agent need a workflow? An AI agent can interpret information and suggest next steps, but the process should define the sequence of actions, required validations and the people responsible for approval. In a large organisation, this is what determines the repeatability and scalability of the solution. A process automation platform can act as a control layer: it triggers a task, provides the agent with the necessary context, receives the result, records the history and routes the case to the next stage. The agent then becomes part of a controlled workflow rather than operating as a separate tool outside the core process. This approach is relevant to document workflows, request handling, HR processes, procurement and administration. See how WEBCON BPS can support the digitalisation and control of business processes, and how TTMS delivers process automation. Four forms of human oversight of an AI agent Human-in-the-loop is a model of control embedded in the process—from reviewing recommendations to handling exceptions and making decisions with greater impact. In a mature solution, people can play several different roles. Approving an action when the agent has prepared a specific instruction, communication or system change. Selecting an option when the agent has presented several possible solutions and their consequences. Handling an exception when a case falls outside the agent’s rules, available data or permissions. Overseeing process quality by analysing errors, rejected recommendations, completion times and changing business needs. The most effective implementations use all four forms. The team does not manually review every standard operation, yet retains full control over actions with greater significance and over the direction in which the process evolves. It is also worth observing whether human approval genuinely improves process safety or simply moves a bottleneck elsewhere. If an approver nearly always accepts the agent’s proposals without changes and the cases are easy to reverse, the organisation can consider automating the selected step. If recommendations often require correction or the approver needs to return to source data, this indicates that the process rules, quality of knowledge or scope of the agent’s permissions need attention. When can an AI agent act automatically? Automation delivers the most value when a task is frequent, has a repeatable structure, relies on available data and leads to a clearly defined outcome. It is also important to ensure that execution can be verified and corrected when data or rules change. Good candidates include ticket classification, routing requests to the appropriate queue, completing data from approved sources, creating standard tasks, updating statuses and sending communications based on approved templates. Combining GPT models with an enterprise knowledge layer, integrations and security rules provides a significant advantage. This allows the solution to work with information available to a specific role, rather than with an unstructured collection of documents and conversations. When should an AI agent primarily provide advice? An advisory role is especially valuable in cases that require contextual assessment, interpretation of company policy, negotiation, an individual approach to a customer or decisions with significant financial and legal consequences. In these situations, the agent can gather facts, summarise documents, identify missing information, compare options and prepare the rationale for a recommendation. The person gains time for business judgement, while the decision remains grounded in the knowledge, experience and accountability appropriate to the role. This model is particularly useful for managers, compliance specialists, legal teams, strategic procurement, finance teams and teams responsible for key accounts. FAQ What is the difference between an AI agent and a chatbot? A chatbot primarily responds to questions in a conversation. An AI agent can also use approved tools, retrieve information from enterprise systems, follow workflow rules and complete defined process steps. Its value comes from combining language understanding with access to business context, permissions and a controlled process. Should every AI agent have human approval before taking action? No. The appropriate level of oversight depends on the impact and reversibility of the action. Low-risk, repeatable activities such as categorising tickets or sending a standard confirmation can be automated under defined rules. Actions affecting customers, contracts, finances, compliance or production systems should usually include approval or escalation to an authorised person. Can one AI agent operate at different levels of autonomy? Yes. Autonomy should be designed for individual actions rather than assigned to an entire solution. The same agent may classify a request automatically, prepare a response for approval and escalate an unusual case to an expert. This makes it possible to automate safely without treating every task in the same way. What information does an AI agent need to work reliably in an enterprise? An agent needs access to reliable and current knowledge sources, a clearly defined objective, appropriate permissions and rules for handling exceptions. It should also receive only the context relevant to the task and role. Workflows, validations and an auditable history of actions help ensure that its output can be reviewed and used consistently. How can a company start implementing GPT-powered AI agents? Start with one clearly defined process step that has measurable volume, repeatable inputs and a known outcome. Set the boundaries of the agent’s permissions, test it with standard and exceptional cases, and measure the effect on process time, quality and escalations. Once the team has evidence that the solution works reliably, its scope and autonomy can be expanded gradually.

NIS2 Compliance Documentation: What Evidence Should Businesses Prepare?

NIS2 compliance cannot be demonstrated by a policy library alone. A regulator, auditor or management body may need to understand not only what an organisation intended to do, but also whether its cybersecurity measures were approved, implemented, tested and improved over time. That distinction makes evidence management a central part of NIS2 readiness. Policies describe the expected approach. Evidence shows that people followed it, controls operated, exceptions were governed and material weaknesses reached the right decision-makers. Directive (EU) 2022/2555, known as NIS2, does not prescribe one universal folder of documents for every regulated entity. It establishes outcomes and minimum areas that essential and important entities must address through appropriate and proportionate technical, operational and organisational measures. The exact records expected from an entity depend on its risk profile, services, sector, size, national implementing law and, in some cases, sector-specific EU rules. This guide explains how businesses can build practical NIS2 compliance documentation, what evidence may support Articles 20, 21 and 23 of the Directive, and how to organise a defensible evidence pack without creating unnecessary bureaucracy. It is designed as a documentation and assurance guide—not as another general implementation roadmap or audit checklist. 1. Does NIS2 require specific compliance documentation? NIS2 does not contain a single exhaustive schedule titled “documents every entity must maintain”. Instead, it creates duties that are difficult to perform or demonstrate without reliable records. Article 20 requires management bodies of essential and important entities to approve cybersecurity risk-management measures, oversee their implementation and follow relevant training. Article 21 requires appropriate and proportionate risk-management measures covering at least ten specified areas. Article 23 establishes staged reporting for significant incidents. Supervision provisions allow competent authorities to request information and access data, documents or other evidence needed for their tasks. As a result, documentation should support three questions: What decision, process or control was required? Who approved, owned or performed it, and when? What evidence shows that it operated and produced the intended result? National legislation may define additional documents, registration information, audit requirements, reporting forms or retention periods. Organisations operating in several Member States should therefore maintain a jurisdiction register instead of assuming that one evidence pack satisfies every national procedure. Certain DNS, cloud, data-centre, managed service, managed security, online marketplace, online search, social networking and trust service providers are also subject to Commission Implementing Regulation (EU) 2024/2690. For those entities, the Regulation and ENISA’s supporting technical guidance provide more detailed requirements and examples. Other entities may use that material as a reference, but should not present it as automatically binding outside its legal scope. 2. Documentation, records and evidence: what is the difference? These terms are often used interchangeably, but separating them improves assurance. Category Purpose Examples Governing documents Define what the organisation expects and who is responsible Policies, standards, procedures, governance charters and control descriptions Operational records Show that a process or control was performed Access reviews, vulnerability tickets, backup logs, supplier assessments and training records Decision evidence Shows how risks, exceptions and priorities were considered Management minutes, risk acceptance, investment approvals and escalation records Effectiveness evidence Shows whether measures work as intended Test results, restoration exercises, metrics, audits and verified remediation Regulatory records Support registration, notification and supervisory engagement Scope analysis, authority correspondence, incident reports and information requests A policy is not proof that the process operates. A screenshot is not necessarily reliable evidence if its source, date, scope and owner are unclear. A test report has limited value if no one owns the findings or verifies their closure. Strong evidence connects design with operation. It should allow a reviewer to trace a requirement to a control, the control to its owner, the owner to operational records and any failure to a documented decision or corrective action. 3. Build a NIS2 evidence map before collecting files Collecting everything creates cost, confusion and additional security risk. A better approach begins with an evidence map. An evidence map links each applicable obligation to the entity’s controls and records. It can be maintained in a governance, risk and compliance platform or a controlled spreadsheet, provided ownership, versioning and access are appropriate. Evidence-map field What to record Legal or control reference Applicable NIS2 article, national provision, implementing rule or internal control requirement Expected outcome The risk or service outcome the measure should achieve Control description How the organisation addresses the outcome Owner and operator Who is accountable and who performs the activity Evidence source System, repository or process producing the record Frequency or trigger Monthly, quarterly, annually, after change or after an incident Reviewer Who assesses completeness and effectiveness Retention and protection How long evidence is kept and how access and integrity are protected Status and exceptions Current result, open gaps, accepted risk and remediation The map should reflect the services and systems in scope. A generic template can accelerate the work, but it should not become the basis for unsupported declarations. Where a control does not apply, the organisation should record the reason rather than leaving an unexplained blank. 4. Scope and applicability records An organisation cannot build credible NIS2 compliance documentation without first establishing which entities and services are covered. Scope evidence is especially important for corporate groups, cross-border operations and businesses whose activities cross several sectors. A documented applicability file may include: a list of relevant legal entities and establishments; services and activities mapped to Annex I or Annex II of NIS2 and corresponding national provisions; employee and financial data used for size classification; analysis of partner and linked enterprises where relevant to SME calculations; size-independent rules and any specific designation decisions; jurisdiction and competent-authority mapping; interaction with sector-specific EU legislation, such as DORA; legal advice or internal interpretation supporting uncertain classifications; review triggers for acquisitions, new services, restructuring and legislative change. The purpose is not to produce a long legal memorandum for every entity. It is to make the conclusion reproducible. A reviewer should be able to see what facts were considered, which version of the law was used and who approved the result. Scope documentation should also identify the network and information systems supporting covered services. Legal entity boundaries do not always match technical boundaries. Shared identity platforms, cloud tenants, data centres or managed providers may support several companies and services, making dependency evidence important. 5. Governance and management-body evidence Article 20 makes management involvement a substantive requirement. Evidence should show more than the presence of cybersecurity on an annual agenda. 5.1 Approval of cybersecurity risk-management measures Approval evidence can include board or management-body minutes, resolutions, decision papers and approved policy sets. The record should identify what was approved, the scope of the decision, material risks, known limitations, required resources and the reporting mechanism used to oversee implementation. Where a large package is approved, a controlled index can identify all included documents and their versions. This avoids uncertainty about whether a policy was actually part of the decision. 5.2 Oversight of implementation Oversight records may include periodic dashboards, risk committee minutes, programme status reports, overdue-action escalations and decisions concerning residual risk. Reporting should enable informed challenge. Useful indicators connect controls with service outcomes. Examples include the proportion of critical services covered by tested recovery plans, overdue remediation for critical vulnerabilities, privileged access awaiting review, critical suppliers without current assurance and high-risk audit findings past their agreed date. Raw activity counts are weaker. The number of alerts processed or employees trained may be relevant, but it does not by itself show whether the organisation can protect and recover its services. 5.3 Management training Training evidence should record the audience, date, subject matter, facilitator and completion. The content should help management understand its responsibilities, the entity’s threat and risk profile, significant-incident escalation, risk acceptance and oversight expectations. An attendance list alone may not demonstrate that training was suitable. Agenda materials, learning objectives, exercises or confirmation of understanding provide stronger context. 5.4 Accountability and delegated responsibility An organisation should maintain a current responsibility model. This can include governance terms of reference, role descriptions, RACI matrices, escalation paths and authority for risk acceptance. Operational tasks may be delegated to security teams, technology owners or providers. Documentation should still show how the management body receives assurance and how material matters are escalated. Outsourcing a control does not outsource the regulated entity’s responsibility for managing its risk. 6. Risk-analysis and treatment evidence Article 21 begins with policies on risk analysis and information-system security. A defensible evidence trail demonstrates that risk management affects decisions and investment. Core documentation may include: the approved cybersecurity risk methodology; risk criteria, impact scales and likelihood definitions; a service, process, information and technology inventory; risk assessments and a current risk register; treatment plans with owners, resources and deadlines; risk acceptance and exception records; reassessment after material change or an incident; links between risks, controls, suppliers and continuity priorities. The risk register should not be an isolated spreadsheet owned only by the security team. Material risks need accountable business owners and a route to management. Treatment records should make clear whether the organisation is reducing, avoiding, transferring or accepting the risk. Exceptions require particular care. A patching exception, unsupported system or delayed access review should identify the affected service, reason, compensating measures, approver, expiry date and review. Open-ended exceptions weaken both security and evidence quality. 7. Evidence for the Article 21 risk-management areas The following examples illustrate records that may support the ten minimum areas in Article 21. They are not a universal statutory checklist. Article 21 area Examples of useful evidence Risk analysis and system-security policies Methodology, risk register, policy approvals, review history and exception records Incident handling Response plan, severity criteria, incident tickets, communication logs, exercise reports and lessons learned Business continuity, backup and crisis management Business impact analysis, recovery objectives, continuity plans, backup monitoring, restoration results and crisis exercises Supply-chain security Supplier inventory, risk tiering, due diligence, security clauses, assurance reports, monitoring and exit plans Secure acquisition, development and maintenance Security requirements, architecture reviews, secure-development records, change approvals, vulnerability tickets and patch evidence Assessment of effectiveness Control testing, penetration tests, audits, metrics, findings and verified remediation Cyber hygiene and training Baseline standards, update and configuration records, role-based training, simulations and follow-up actions Cryptography and encryption Cryptographic policy, approved standards, key and certificate inventories, rotation logs and exception decisions Human resources security, access and assets Screening where lawful, joiner-mover-leaver records, access reviews, privileged-account evidence and asset inventories MFA and secure communications Coverage reports, enrolment and recovery controls, exception records, authentication tests and emergency communication exercises Evidence must remain proportionate. A small important entity and a multinational essential entity may address the same legal area with different operating models and documentation depth. The key question is whether the record is sufficient to demonstrate the control in the context of the entity’s risk. 8. Incident-reporting documentation Article 23 requires essential and important entities to notify significant incidents through a staged process. The Directive provides for an early warning without undue delay and within 24 hours of awareness, an incident notification within 72 hours, and generally a final report within one month of the incident notification. Intermediate or progress reports may also be required. Incident evidence should support both response and the reporting decision. Useful records include: the time and source of initial detection; the point at which the organisation became aware of the incident; technical and business severity assessments; the significant-incident assessment and its approver; affected services, systems, users and other persons; suspected malicious or unlawful activity; indicators of compromise and cross-border implications where available; containment, mitigation and recovery actions; copies of regulatory submissions and acknowledgements; customer, contractual, data-protection and law-enforcement communications; decision logs showing what was known and unknown at each stage; root-cause findings, lessons learned and corrective actions. 8.1 Preserve a reporting timeline The reporting clock can begin before a complete forensic conclusion is available. A reliable timeline is therefore essential. Systems should use synchronised time sources, and the incident lead should record material decisions as they occur. The evidence should distinguish facts, assumptions and pending investigation. Early notifications can be qualified. A clear record of uncertainty is more credible than retrospective notes that imply the organisation knew everything at the start. 8.2 Document non-reporting decisions Not every security event meets the threshold of a significant incident. When an event is assessed as non-reportable, the organisation should retain a proportionate record of the facts, criteria and decision. This helps demonstrate consistency and enables later reassessment if the impact changes. National law, authority guidance and the Commission Implementing Regulation for specified digital and ICT entities may provide additional thresholds and procedural detail. Reporting templates and contact information should be maintained for every relevant jurisdiction. 9. Supply-chain and supplier evidence Supplier documentation should show that the organisation understands which relationships could affect its covered services and applies scrutiny proportionate to the risk. An evidence set may contain: a supplier inventory linked to services and information assets; inherent-risk and criticality classifications; due-diligence questionnaires and supporting documents; independent assurance reports and certifications, with scope and exceptions reviewed; security requirements in contracts and statements of work; incident-notification and cooperation provisions; subcontracting, location and concentration-risk information; access granted to supplier personnel and periodic access reviews; performance, vulnerability and incident monitoring; reassessment records following change or an incident; continuity, substitution and secure-exit plans. A certificate should not be stored without analysis. Its scope, period, exclusions and relationship to the delivered service matter. Similarly, a completed questionnaire is a supplier statement, not independent proof. Higher-risk suppliers may require interviews, technical evidence, independent reports or contractual verification rights. Documentation should also record the organisation’s response to deficiencies. Accepting a supplier risk without an owner, expiry date or compensating measure creates an unmanaged exception. 10. Business continuity, backup and recovery evidence Continuity documentation should connect business priorities with technical recovery capability. The evidence chain may begin with business impact analysis and service dependency maps. These should support recovery time and recovery point objectives, response priorities, backup architecture, alternative procedures and supplier arrangements. Operational evidence can include: current continuity, disaster-recovery and crisis-management plans; protected backup configuration and monitoring; restoration tests showing which data and systems were recovered; actual recovery duration compared with approved objectives; test limitations, failures and remediation; exercise attendance, decisions and lessons learned; emergency contacts and out-of-hours escalation checks; evidence that critical providers participated where relevant. A successful backup job is not the same as a successful recovery. Evidence should demonstrate that required data can be restored into an operable service under plausible conditions. Testing should vary scenarios. Tabletop exercises are useful for decisions and communication, while technical restoration tests provide evidence of recovery capability. More complex entities may use integrated exercises involving suppliers, facilities and business teams. 11. Security-control and technical evidence Technical evidence is often abundant but difficult to interpret. The objective is not to export every log. It is to retain records that demonstrate scope, operation, review and response. Examples include: approved secure-configuration baselines and compliance reports; vulnerability scan coverage and remediation tickets; patch status linked to criticality and exceptions; endpoint, network and cloud monitoring coverage; identity and privileged-access reviews; MFA coverage and bypass exceptions; encryption, key and certificate management records; change approvals and security testing; secure-development and dependency-scanning results; asset inventory completeness checks; alert investigations and response outcomes. Tool screenshots should be used carefully. Prefer repeatable reports or system exports with the source, timestamp, query scope and responsible reviewer recorded. Evidence should be protected from unauthorised modification, particularly when it may support an investigation. 12. Evidence that measures are effective Article 21 includes policies and procedures to assess the effectiveness of cybersecurity risk-management measures. This means documentation should go beyond implementation status. An effectiveness file can combine: defined control objectives and success criteria; control self-assessments; technical testing and independent review; security and resilience metrics; internal and external audit reports; incidents and near misses indicating control performance; trends and recurring weaknesses; corrective actions with owners and deadlines; proof that high-risk remediation was independently verified. Metrics should be interpreted. For example, “98% of critical systems patched on time” requires a defined population, a reliable inventory, treatment of exceptions and information about the remaining 2%. A positive average can conceal exposure in a critical service. Management reporting should distinguish control design, implementation and effectiveness. A control can be well designed but inconsistently operated, or widely deployed but ineffective against a realistic threat. 13. How to assemble a NIS2 evidence pack An evidence pack is a controlled view of relevant records, not a permanent duplicate of every operational file. 1. Start with an index The index should identify the requirement, document or evidence item, owner, version or period, source location, access classification and review status. It should also identify unavailable evidence and open remediation. 2. Use service-based navigation Regulatory obligations apply to entities, but operational impact occurs through services. Organising evidence around covered services helps reviewers understand dependencies, risks, controls and recovery priorities. 3. Select representative periods and samples Evidence should show operation over time. One access review performed immediately before an assessment does not demonstrate a mature quarterly process. Samples should cover the relevant period, locations and technologies. 4. Preserve source and context Each item should make clear where it came from, who produced or approved it, the date, scope and meaning. Remove unexplained screenshots, unlabeled exports and drafts that could be mistaken for approved records. 5. Record gaps honestly Do not create evidence retrospectively to imply that an activity occurred. Where evidence is missing, document the gap, immediate risk response, owner and remediation date. Transparent remediation is more defensible than an unreliable record. 6. Perform quality review Legal, security, risk and service owners should check consistency. The asset inventory should agree with vulnerability coverage. Supplier classification should drive assurance. Recovery objectives should match test reports. Management minutes should reflect the material risks shown in dashboards. 14. Evidence quality principles A practical evidence standard can be expressed through seven characteristics: relevant: it supports a defined requirement or control; authentic: its origin and ownership can be established; complete: it includes the scope and context needed for interpretation; accurate: it reflects what actually occurred; timely: it covers the required period and was produced at the appropriate time; protected: access, integrity and confidentiality are controlled; retrievable: authorised teams can find it when required. These principles help teams decide whether a proposed record adds assurance or merely volume. 15. Retention, confidentiality and evidence security NIS2 does not establish one universal retention period for every type of compliance record. Retention should be determined using national requirements, limitation periods, sector rules, contractual duties, audit cycles, incident-investigation needs and the organisation’s risk. Evidence may contain sensitive architectural details, vulnerabilities, personal data, credentials, supplier information or legal advice. It should be classified and protected accordingly. Collecting material for an assessment does not justify placing unrestricted copies in a shared folder. The organisation should define: approved repositories and access roles; version control and approval status; retention and defensible disposal; legal hold and investigation procedures; integrity protection and backup; secure transfer to auditors or authorities; handling of personal and privileged information; return or deletion of assessment copies. Data minimisation matters. Evidence should be sufficient for its purpose without exposing unnecessary personal data, secrets or complete security configurations. 16. Common NIS2 documentation mistakes 16.1 Treating policies as proof of operation Policies establish intent. They need corresponding reviews, logs, tests, decisions and corrective actions. 16.2 Collecting screenshots without context A screenshot may not show the source, date, population, filters or reviewer. Use controlled exports and explanatory notes where possible. 16.3 Building the evidence pack only before an audit Last-minute collection produces gaps and inconsistent records. Evidence generation should be embedded into normal control operation. 16.4 Keeping expired exceptions open Exceptions should have owners, compensating measures and expiry dates. Repeated extensions require appropriate challenge and escalation. 16.5 Storing sensitive evidence too broadly Centralisation improves retrieval but can create a valuable target. Use classification, least privilege, logging and secure transfer. 16.7 Ignoring contradictory records An approved policy may claim quarterly reviews while operational records show annual activity. Resolve discrepancies instead of presenting them as separate truths. 16.8 Equating certification with complete NIS2 evidence ISO/IEC 27001 certification can provide useful governance and control records. It does not automatically demonstrate legal scope, national notification procedures or every NIS2 outcome. The certification scope and statement of applicability must be understood. 17. NIS2 compliance documentation checklist Use this checklist as a planning aid. Adapt it to the entity, national law and risk profile. [ ] Applicability and jurisdiction analysis is documented and approved. [ ] Covered services, systems, data, people, facilities and suppliers are mapped. [ ] Management approval of risk-management measures is traceable. [ ] Management oversight and cybersecurity training records are current. [ ] Roles, escalation paths and risk-acceptance authority are defined. [ ] Risk methodology, assessments, register and treatment plans are maintained. [ ] Policies are version-controlled, approved and linked to operating procedures. [ ] Incident records preserve awareness, decisions, actions and reporting timelines. [ ] Non-reporting decisions for material events use documented criteria. [ ] Continuity and recovery documentation is linked to critical services. [ ] Backup and restoration evidence demonstrates recoverability. [ ] Supplier inventory, classification, due diligence and monitoring are current. [ ] Security clauses and supplier-exit arrangements reflect criticality. [ ] Vulnerability, patch, configuration and change records show control operation. [ ] Access, privileged accounts and MFA exceptions are reviewed. [ ] Cryptographic keys and certificates are governed and monitored. [ ] Training evidence is role-based and includes effectiveness indicators. [ ] Control testing and audits produce owned, time-bound remediation. [ ] High-risk findings have verified evidence of closure. [ ] Evidence retention, access, integrity and secure transfer are defined. [ ] The evidence index identifies missing or outdated records. [ ] Evidence is reviewed after major incidents, changes and regulatory updates. 18. How this guide fits with implementation and audit work Documentation should emerge from real controls. Organisations that are still designing their programme can use TTMS’s practical guide to implementing NIS2 for a broader implementation perspective. For a general overview of business duties, see cybersecurity obligations of businesses under NIS2. An implementation programme creates and operates controls. An evidence programme makes their ownership, decisions and results demonstrable. An audit or assessment then evaluates whether the measures and evidence satisfy the applicable criteria. These activities support one another but should not be confused. 19. Why TTMS? Building a NIS2 evidence model requires an understanding of regulation, governance and the technology that generates operational records. TTMS can support organisations in mapping applicable requirements to services, controls, owners and evidence sources, then integrating those records into practical workflows. Support may include evidence-readiness assessments, governance and responsibility design, control mapping, documentation frameworks, supplier assurance, incident and continuity exercises, technical-control verification and remediation planning. The objective is not to create documents for their own sake. It is to help the organisation establish records that reflect working security measures and provide management with reliable assurance. Engagement scope should be tailored to the entity’s legal position, national requirements, risk profile and existing management systems. Legal conclusions should be confirmed by appropriately qualified advisers, while technical and organisational evidence should support those conclusions accurately. 20. Prepare a defensible NIS2 evidence pack Organisations should not wait for an authority request or audit notice before locating their records. Start with the services in scope, the decisions management must make and the controls protecting those services. Then identify which reliable records demonstrate operation and effectiveness. Contact TTMS to discuss a NIS2 documentation and evidence-readiness assessment tailored to your organisation. For authoritative background, consult the European Commission overview of the NIS2 Directive, ENISA’s NIS2 implementation resources and the official text of Directive (EU) 2022/2555 on EUR-Lex. 21. Frequently asked questions about NIS2 compliance documentation What documentation is required for NIS2 compliance? NIS2 does not prescribe one universal document pack. Covered entities need records sufficient to demonstrate management approval and oversight, appropriate and proportionate risk-management measures, significant-incident reporting and compliance with applicable national procedures. Typical evidence includes scope analysis, governance decisions, risk records, policies, operational control records, supplier assurance, continuity tests, incident files, metrics, audits and remediation. Is a policy enough to prove NIS2 compliance? No. A policy describes the intended approach. Evidence of operation may include approvals, system records, reviews, test results, incidents, exceptions and corrective actions. A reviewer should be able to connect the policy to actual controls and accountable owners. Does NIS2 require an information security management system? NIS2 requires a governed set of appropriate and proportionate cybersecurity risk-management measures. National law may expressly require an information security management system, and an ISMS is a practical way to organise policies, risk management, controls and improvement. Organisations should verify the terminology and detailed requirement in each relevant jurisdiction. Does ISO 27001 certification provide sufficient evidence? ISO/IEC 27001 certification can provide valuable evidence, but it is not automatic proof of complete NIS2 compliance. The certification scope, exclusions and statement of applicability matter. Legal scope, management duties, national registration and incident-reporting procedures still require specific assessment. How long should NIS2 evidence be retained? There is no single NIS2 retention period covering every record. The organisation should define retention using national law, sector obligations, audit cycles, limitation periods, contractual duties, investigation needs and risk. Sensitive evidence should be disposed of securely when retention is no longer justified. Should every security log be placed in the evidence pack? No. The pack should provide a controlled view of relevant evidence. Operational logs may remain in their source systems, with an index describing ownership, scope, retention and retrieval. Export only what is necessary and protect sensitive technical information. What evidence should the management body receive? Management should receive information enabling approval and effective oversight: material risks, measure implementation, significant incidents, control failures, critical supplier exposure, effectiveness results, overdue high-risk actions and decisions requiring acceptance or investment. How should incident-reporting decisions be documented? Record awareness time, affected services, severity and impact, applicable thresholds, known and unknown facts, the decision-maker and the basis for reporting or not reporting. Keep copies of notifications, acknowledgements and subsequent updates. Follow applicable national procedures. What supplier evidence is useful for NIS2? Useful records include supplier criticality, due diligence, assurance reports, contractual security provisions, access reviews, monitoring, incident cooperation, continuity arrangements and exit plans. Evidence depth should reflect the supplier’s access and potential impact on covered services. How often should the NIS2 evidence pack be reviewed? Set a risk-based schedule and update records through normal operations. Additional review should follow material incidents, acquisitions, major system or service changes, new critical suppliers, significant control failures and legal updates. Who should own NIS2 compliance documentation? Ownership is distributed. Legal or compliance teams may maintain the requirements map, while security, IT, service owners, procurement, HR and continuity teams own operational records. A central coordinator should manage the evidence index, quality checks and escalation without becoming the artificial owner of every control.

ChatGPT 5.6 in Practice: Initial Compliments and Disappointments

OpenAI rolled out GPT-5.6 in stages. It first appeared in limited test access for selected partners. Access to ChatGPT 5.6 reached Europe, including Poland, gradually, so only recently have teams been able to test the model in everyday work. Expectations are high. In the second half of 2026, businesses expect language models to handle multi-step tasks and work with extensive context. Ease of use matters too. GPT’s interface has undergone a major redesign. Has it improved the user experience and the quality of responses? This article explores that question, as well as: which business processes ChatGPT 5.6 can support by improving productivity and the quality of working materials, how to plan an AI pilot in your organisation, measure results and maintain quality control, which limitations of ChatGPT 5.6 to consider before a wider rollout, how to establish a shared standard for prompts and output validation across the team, what early users think about working with ChatGPT 5.6. If you are looking for a full overview of the changes, pricing, models and capabilities of GPT-5.6, see our article GPT-5.6 from OpenAI: what has changed, pricing, capabilities and business applications. ChatGPT 5.6: our first impressions and early industry feedback Early expert reviews focus primarily on context handling. Reviewers note that when working with substantial material that goes through multiple rounds of edits, ChatGPT 5.6 is better at keeping the task on track. Most of us have experienced earlier OpenAI models losing their “bearing”. On top of that, the model itself encouraged endless revisions, which could pull the material away from the original intent of the prompt. GPT 5.5 had an irritating habit of suggesting more and more variations. Almost every response ended with a clickbait-style suggestion along the lines of: “If you want, I can help you add two elements that will create a wow effect and give the text around 50% more SEO power.” As a result, instead of closing the topic, we were drawn into the model’s endless doubts: could the material really not be improved further? GPT 5.6 is no less capable than the older model, but it finally respects what matters most: the intent behind the prompt and our time. Kajetan Terlecki SEO Specialist, TTMS Another recurring observation concerns the quality of the first draft—the material GPT produces after the first prompt. Reviewers emphasise that the model’s draft is usually well structured and much closer to a final version than it was with GPT 5.5. It is not a perfect ten yet, but a solid eight. In other words, a final version may be within reach after a relatively short time. With earlier GPT models, the “brainstorming” phase took much longer. The third—and most immediately noticeable—area is the way we use the tool, which we can simply call the “interface”. It is admittedly quite complex. Beyond writing a prompt, users must make a series of decisions: which workspace should I choose: Chat or Work? which model best fits my request: Luna, Terra or the most advanced Sol? Or is the older GPT 5.5 enough? does the task require Deep Research? how much effort should the model put into the task: low, medium, high, very high, max or ultra? should I use Turbo mode and generate a response 50% faster at the cost of higher token use? If we add the almost endless range of available plugins, writing the prompt turns out to be only half the work required to get a useful result. I would welcome an automatic mechanism that reads the prompt and selects the right settings on its own. One that uses a sufficiently capable GPT model without wasting tokens when they are not needed. How do you navigate all this? We have outlined a suggested configuration here, including which modes to use for different types of tasks. Where does GPT 5.6 outperform the previous version? 1. GPT 5.6 is better at preserving document layout and formatting The previous version of GPT had something of a goldfish memory. You could also compare it to a short blanket: pull it over one part, and another is left exposed. When we asked the model to update data in a document it had generated, it produced a factually correct response, but one that no longer followed the original format. It might use a different heading hierarchy, rearrange the information or omit elements that are essential for the company. GPT 5.6 is much better at preserving the structure of reference material. OpenAI illustrated the difference in materials introducing GPT-5.6. The company placed three slides side by side: the reference file, the GPT-5.5 output and the GPT-5.6 output. The task was to update figures in a presentation while retaining the original template. In the comparison, GPT-5.5 omitted some template elements, while GPT-5.6 preserved the slide structure more faithfully: layout, typography, spacing, colours and recurring template elements. OpenAI states that GPT-5.6 can also interpret rules saved in the slide template, including the Slide Master. In practice, this matters when a presentation needs to retain not only its colours and fonts, but also defined layouts, spacing and mandatory components. 2. GPT-5.6 moves beyond the chat window GPT-5.6 shows its greatest potential when it works not only with a single instruction, but also with files and tools made available by the user. It can then move quickly through a task: from gathering the materials to preparing a first draft. The new GPT model can identify related files in a project folder, flag places that need updating and prepare working versions of documents. There is a catch: the process still needs human oversight. Someone must check whether GPT found all the relevant files, understood the context correctly and left unchanged the elements that were meant to remain unchanged. Still, instead of manually digging through documents, the team starts with a list prepared by the model. 3. From an idea to a version you can show the team Experts testing GPT 5.6 point out that the first version of a simple application, dashboard or website is now more often suitable for showing to a team and collecting specific feedback. It is somewhat like an MVP: good enough to test an idea, present it to the team and gather initial comments. A product owner can see the whole process, a designer can assess the layout and usability, and a developer can spot technical constraints sooner. This does not mean that GPT-5.6 creates a finished product. The initial prototype still needs to be assessed for security, quality and architecture. The difference is concrete, however: the team can evaluate an actual solution earlier, rather than debating assumptions alone. 4. GPT 5.6: “I don’t know” — is this the end of answers given for the sake of answering? We all know the old classified ad: “Encyclopaedia Britannica, 40 volumes for sale. I got married a week ago, so I no longer need it. My wife knows everything better.” The know-it-all syndrome is a nuisance not only in old marriage jokes, but also for people who work with language models every day. GPT often lacks the information needed to give a reliable answer. GPT-5.5, like earlier versions, would rather provide an incorrect—yet convincing-sounding—answer than admit it did not know. What about the new version? The change is visible at first glance, even though it is hard to capture in a benchmark and easy to appreciate in day-to-day work. Our first days of working with the two most advanced models, Terra and Sol, suggest that GPT 5.6 is more likely to say “I don’t know”, “I don’t have enough data” or “I could not find anything else on this topic”. People still need to add or verify information manually, but this reduces the risk of an embarrassing error in material prepared for a client, the board or a project team. Before you give GPT-5.6 an important task: what to watch out for in early testing 1. A working prototype is not yet a finished product GPT-5.6 can prepare a website, dashboard or simple application that can be launched and shown to the team. This is a major step forward, particularly when testing an idea. The tests also reveal the other side: elements can become misaligned, interactions do not always work as intended, and visual details still require refinement. The first version can be an excellent starting point, but it should not automatically be sent to clients or other external audiences. Before treating it as finished, we need testing, a security assessment and, in some cases, a developer’s review. 2. The new Work environment can still be frustrating Model quality is one thing. The way we use it in practice is another. One reviewer pointed out that, in Work, it was difficult to access generated files and open a preview of the finished result. Others criticised the number of settings—discussed earlier in this article—as well as the unclear distinction between Chat, Work and Codex. GPT-5.6 may complete a task correctly, while the working environment still makes it difficult to retrieve or review the result. It is worth testing the entire process, not only the quality of the response in the chat window. 3. GPT needs clear boundaries One reviewer tested how GPT-5.6 would handle a complex mathematical problem. The model produced correct parts of the solution, but surrounded them with definitions, digressions and comments that added little value. Only after the instruction was made more specific did it produce a useful result. The same applies in a business context. We should not leave the model too much room for interpretation. It is better to state the expected result directly: “Prepare a one-page summary. Include the decision, three arguments, risks, missing information and next steps.” GPT then has fewer opportunities to pad the topic with peripheral content. 4. GPT can still be wrong The fact that GPT-5.6 appears more likely to signal that it lacks data or a basis for drawing a conclusion does not mean it is free from hallucinations. Luna, Terra and Sol—with Sol seemingly the least prone to this—can still provide an incorrect date, number, source or conclusion without batting an eyelid. The rule to “check after AI” still applies and will likely remain relevant for many future GPT releases. 5. Start with one problem, not a large system Once GPT-5.6 has access to files, a browser and company tools, it is easy to imagine a system that instantly organises the inbox, analyses team communication, updates the CRM and writes responses to clients. This vision can quickly turn into a project larger than the problem it was meant to solve. One expert working with an extensive Codex environment recommends starting with a single, repeatable task. It might be preparing a meeting summary, gathering open project issues or updating an offer after data changes. Only once the team sees measurable results and understands the tool’s limitations is it worth adding further automations. How should you run your first ChatGPT 5.6 test in the company? A pilot should answer one straightforward question: does GPT-5.6 genuinely improve a selected stage of work, and does the benefit justify the time, cost and additional quality control? The first test should not begin with building an extensive automation system. It is better to choose one repeatable task that currently takes up the team’s time and has a clearly defined outcome. This might be a meeting summary, a brief or a status report. What matters is that the team knows which materials it provides to the model, what result it expects and who reviews the final document. Before starting the pilot, answer five questions: Choose one process: for example, preparing meeting summaries, sales briefs or materials for project decisions. Set a baseline: measure the time needed to prepare the material, the number of revisions, the number of people involved and the most common errors. Prepare a shared prompt: use the same input materials and clearly describe the outcome the team expects. Assign expert review: nominate a person who will verify the facts, assess quality and approve the result before it is used further. Assess the outcome: compare time, the number of iterations, completeness of the material and the usefulness of the result for the next stage of the process. Pilot element Question for the team Process Which stage of work do we want to shorten or organise? Outcome What should be produced: a brief, decision list, analysis, recommendation or communication draft? Data Which materials are needed, and can they be used in the selected AI environment? Quality control Who confirms the facts, completeness and alignment of the material with the process? Metric How will we compare working time, the number of revisions and the usefulness of the result? After a few attempts, it becomes easier to assess whether the model is genuinely helping. Compare the time needed to prepare the material, the number of revisions and the effort required to verify the result. Only then decide whether to extend the pilot to further tasks. Three processes worth starting with 1. Summaries after client meetings The model can organise notes, gather decisions, identify open questions and prepare a list of next steps. The team confirms the arrangements and assigns task owners. This helps them move from discussion to action more quickly. 2. A brief for a sales conversation Based on selected sales materials, previous arrangements and public information about the company, GPT-5.6 can prepare a brief, discovery questions and a list of topics that require clarification. The salesperson remains responsible for the client relationship and decisions regarding the offer. 3. A status report for the project team The model can organise information about progress, blockers, risks and planned actions. The project owner confirms that the information is up to date before the report is shared further. This reduces the time the team spends manually consolidating data from several sources. How do you embed AI in a business process? After the pilot, it becomes clear whether ChatGPT 5.6 genuinely shortens the preparation of materials, reduces the number of revisions and helps the team move more quickly to the next stage of work. It also reveals where the model needs a better brief, access to data or expert oversight. Proven use cases can then be extended to other processes. At this stage, it is worth addressing data security, integration with existing tools, output quality and a clear division of responsibilities. These factors determine whether AI becomes lasting support for the organisation. At TTMS, we help organisations identify processes where automation and AI create business value. We then design solutions tailored to their data, regulatory requirements and ways of working. We combine engineering experience with a responsible approach to AI governance, confirmed by ISO/IEC 42001 certification. Let’s discuss the processes AI could support in your organisation. FAQ How do you choose a process for your first ChatGPT 5.6 test? The best candidate is a repeatable process that requires gathering several pieces of information and producing a predictable result. Examples include meeting summaries, sales briefs, status reports and document analysis. The team should know the current turnaround time and typical issues, as these provide the baseline for assessing the test. Start with one process and expand the use of AI only after evaluating the outcome. How do you measure the business value of ChatGPT 5.6? During a pilot, measure the time needed to prepare the first version of the material, the number of revisions before approval, the completeness of the output and the expert time required for verification. It is also useful to track metrics related to the next stage of the process – for example, faster meeting preparation, a shorter time to close agreed actions or fewer missing details in a report. This data helps assess team productivity based on actual results and supports decisions about integrating AI into further processes. What data should you prepare for working with ChatGPT 5.6? The model produces better results when the team provides current, well-organised source materials. Before starting, identify which documents take priority, which data must remain unchanged and how unverified information should be marked. The organisation should also define which data can be shared in the chosen AI environment. For personal, financial and confidential data, access rules, retention and compliance are essential. How do you maintain human oversight of the model’s work? Human oversight should be part of the process from the start. The process owner defines the task scope, an expert verifies facts and alignment with requirements, and an authorised person approves external actions. This division of responsibilities is particularly important for client communication, publications, data changes in systems and materials with legal or financial implications. It allows the team to use automation while retaining responsibility for the outcome. Where can I find information about GPT-5.6 pricing, models and capabilities? We have covered the changes in GPT-5.6, pricing, the Sol, Terra and Luna models, and business applications in a separate article: GPT-5.6 from OpenAI: what has changed, pricing, capabilities and business applications. This article focuses on the practical use of ChatGPT 5.6 in team workflows, early user experiences and how to run an AI pilot in an organisation.

Wiktor Janicki

We hereby declare that Transition Technologies MS provides IT services on time, with high quality and in accordance with the signed agreement. We recommend TTMS as a trustworthy and reliable provider of Salesforce IT services.

Julien Guillot Schneider Electric

TTMS has really helped us thorough the years in the field of configuration and management of protection relays with the use of various technologies. I do confirm, that the services provided by TTMS are implemented in a timely manner, in accordance with the agreement and duly.

Let’s talk about how TTMS can help.

Monika Radomska

Sales Manager