1 of 100

PRODUCT REFERENCE

Introduction

Introducing CAP: The Composable Agentic Platform

TomorrowX CAP: The Future of Data-Driven Innovation

In the AI era, data is not just an asset - it’s a traveller, navigating a complex network of systems, transformations and decisions. At TomorrowX, we believe that mastering this journey is the key to unlocking the true potential of digital transformation and AI.

A New Perspective: Data as a Traveller

Imagine a data packet embarking on a journey through a modern network—much like a traveller passing through international airports, collecting stickers at every checkpoint. Each sticker represents a critical interaction—security checks, transformations, processing steps—defining the story of how data evolves.

Traditional systems obscure this journey, making it nearly impossible to track data flows, debug application errors, or ensure compliance. CAP changes that. By embedding lightweight agentic components across networks, CAP provides:

🔹 Fine-Grained Control – Gain full visibility and governance over data interactions across AI, cloud and enterprise systems. 🔹 Effortless Scalability – Modify, replace, or enhance capabilities without disrupting existing systems. 🔹 AI-Ready Data – Ensure machine learning models receive structured, high-quality and compliant data. 🔹 Real-Time Insight & Trust – Monitor and validate AI outputs by tracing data’s journey end-to-end.

Beyond Traditional Development: The Power of CAP

🚀 Composable Components – Break applications into modular, reusable building blocks, ensuring flexibility and resilience. 🔍 Secure, Transparent and Sovereign Data Flow – Understand where data originates, how it transforms and where it’s stored. 🛠️ AI-Optimised Orchestration – Enable rapid, compliant, and efficient deployment of AI-powered applications. 🌍 Compliance & Governance – Enforce policies and security protocols across the entire data lifecycle.

Unlike low-code/no-code platforms, CAP doesn’t limit flexibility. Instead, it enhances engineering freedom, ensuring that every component fits the application’s exact needs while maintaining full control over data governance.

Unlike monolithic architectures, CAP allows instant adaptability - expanding or refining functionalities without costly rewrites or system failures.

Mastering the AI Era with TomorrowX

At TomorrowX, we see data not just as an input—but as a dynamic force shaping outcomes, decisions and innovation. In AI, data’s journey matters as much as its destination.

With CAP, organisations gain control over this journey, ensuring AI investments are trustworthy, scalable and secure. By enabling real-time data visibility, compliance and orchestration, TomorrowX empowers businesses to move faster, solve problems smarter and lead in the AI era.

Experience Tomorrow. Get started with CAP today.

Composable Agentic Platform is continuously developed. As a result, some screen captures may differ from the current product version. This will only be the case where educational value derived from the image is not affected by the difference encountered.

Concepts and Terminology

There are a number of terms used in the Composable Agentic Platform which the user should be familiar with. This section introduces these terms and the concepts they represent.

Console

The console is used to administer the system. It is a browser-based application that provides the user with a complete view of all installed servers, rules, configurations and so on. The following is a sample view of the console after the user has logged on.

The console has four distinct panes as described below:

Top Banner

The top banner contains simply the logo and the log out button. This banner is visible on all console pages to facilitate easy log out from the product.

Administration tree

The administration tree is used to navigate the console application. To view the properties for a particular function, click the icon or folder in the tree in front of that function. To get back to the main login page, click on Console at the root of the tree. Increase or decrease the vertical space taken up by the tree by dragging the divider left or right.

Action window

Whenever a function is selected in the administration tree, a corresponding page is shown in the Action window.

Server console viewer

When a server is processing data, it has the ability to output information to an internal console. The console for each server is replicated in real time to the administration console and can be viewed in the pane at the bottom.

Any error messages from the Composable Agentic Platform server will also be visible here. Toggle the visibility of the Server console viewer by clicking the Hide Console or Show Console tab. Enlarge or reduce the size of space taken up by the viewer by dragging the bar across the top up or down.

Servers (X Agents)

Servers or Composable Agentic Platform servers are each instance of the X Agent or a database server. There are six types of servers: test, production, Multi-Protocol, database, template, and production with forwarder.

Test servers

Test servers are used to validate a rule set before it is deployed into production. Test servers can take and process a data file containing test data extracted from a production server or manually created.

Afterwards, performance data from the test server can be used to validate that a particular rule set performed as planned.

Production servers

Production servers are capable of taking a data feed from any number of sources, such as a Web Service, a HTTP request or a data file placed in a folder. Production servers continue running once started and are continuously waiting for data.

Multi-Protocol servers

Multi-Protocol servers are servers that are capable of intercepting one or more standard network protocols and breaking them down for analysis by the Composable Agentic Platform.

They differ from Production servers in that they can take data directly from the network layer without interpretation and can then pass that data through a set of protocol rules and proxy that same (or altered) data to a protected server.

The response from the protected server follows a similar path (with retained visibility of the original request).

Database servers

Database servers do not have an X Agent installed and do not show up as Composable Agentic Platform Servers as such. They can however be defined as servers so that JDBC connections to them can be specified.

Template servers

Template servers are used as "parents" for other servers. This allows settings such as web proxy settings and mail server settings to be inherited by other servers and reduces the maintenance load for large clusters.

Production servers with forwarders

This refers to a group of production servers that have a built in forwarding proxy controlled by Composable Agentic Platform. These servers also have the capability to enable a browser proxy so that the forwarders can be used for testing rules against applications.

Hardware servers versus Composable Agentic Platform servers

It is important not to confuse a Composable Agentic Platform server instance with a hardware server instance. A single hardware server can run many Composable Agentic Platform servers at once.

Projects

Projects are collections of tasks that needs to be completed to fully create a solution. Tasks are assigned to users and are listed as To Do items for that user. All tasks result in a work output of some kind.

By using predefined project definitions, it is easy to set up and track all the basic tasks required to complete a full enterprise deployment (such as rule writing, firewall rules, SSL certificate ordering and so on).

Configurations

Configurations hold the additional information a server needs to be able to execute a rule set. For example, if a server is reading from a CSV file, the configuration will define which column in the CSV file is stored in which variable, before being processed by the rule set.

Input formats, initial variable values, global variables, database information etc., are also all held in configurations.

Rules Editor

The rules editor is a graphical design tool for creating and maintaining rule sets. The rules editor is launched as a separate browser window from within the console application.

Rules, rule catalogue, protocol rules and rule sets

This document refers to rules, the rule catalogue and rule sets throughout. It is important to understand the distinction between them to avoid confusion.

Rule

A rule is a fundamental building block created by a Java programmer. Rules can be dynamically added to the system by downloading them from the TomorrowX website or other sources. In-house programmers can also add them.

Analysts generally do not create rules; they use them to build rule sets.

Examples of rules are: If Condition, History Recorder and Name Splitter.

Rule catalogue

In the rules editor, all of the rules installed are represented in the rule catalogue. It is a tree of all the rules that are available for an analyst to use. Rules are grouped into functional areas.

Protocol rules

There is a different type of rule used to break down elements of a given network protocol. Protocol rules can only be managed by an Administrator. Protocol rules use a different rule catalogue than the standard rules.

A protocol rule is specified against any new supported protocol as well as against the response supplied from a proxied protocol request.

Rule sets

Rule sets are combinations of rules put together to perform a given task or strategy. They are created using the rules editor and are typically a combination of rules and other rule sets. Rule sets created by analysts show up in the rule catalogue in the rule sets section.

When a rule set is completed, it is considered a rule by itself.

Rule set mode

A rule set mode is the rule set being executed when a data set hits the X Agent. By default, there is only one mode, but additional ones can be created to deal with specific situations (such as stand-in, promotion etc.). Modes can be changed without redeploying rules.

Test Data

Test data can either be in the form of manually created CSV, XLS or XML files, or it can be files captured during the execution of a rule set on a server (TST file).

All production servers allow the download of a set of test data at any time to use for further development of rules on a test server.

Trace Data

Trace data is captured during the execution of a rule set on a server (TRC files). They contain detailed information of the flow of data through a rule set. All servers allow a trace to start at any time for use in debugging rules.

Flight Recorders

Flight recorders are databases containing log records of specific activity. A flight recorder starts recording once a trigger event has happened and will then record for a specific number of records, until a time limit is reached, forever, or until it is manually closed. This can be used to track specific user activity.

Data from flight recorders can be converted to test data at any time.

Case Managers

Case Managers are databases and workflows that contain cases to be investigated and followed up by fraud or security experts. Cases can be inserted into the case manager either manually or through rules. Alongside each case are tasks that need to be completed and task queues that facilitate the workflow.

Tasks

Tasks are connected to cases in the case manager. Each case can have one or more tasks that need to be completed. Once a task is completed, the action of completing that task may result in the case changing status (for example from “Open” to “Closed”) or yet another task being generated for that case.

Queues

Tasks are always contained in a queue. Queues are priorities and when a user picks the next task to perform, the queues assigned to that user will be checked consecutively for the next most urgent task.

Data Files

Data files are files used to assist the X Agent in making decisions.

Examples include CSV files of undesirable individuals, IP geo location databases and blacklists.

Content Files

Content files are a structure of files being served up by the X Agent in a manner similar to an HTTP server. The structure mirrors the structure of the files in the protected server and can be used to overlay images and pages on top of an existing application.

Performance Data

Once a rule set has been executed on a server, performance data can be retrieved (in graphical form) to see how rules are performing.

Extensions

The Composable Agentic Platform X Agent is highly extensible. New rules are developed regularly. To allow use of them within the rules editor, an extension may need to be installed (manually or via the update server). This will add the new rules to the rules catalogue.

Protocols

Protocols are network specific, low-level interpretations of the data flowing over a network. Examples of protocols include HTTP, SMTP and ISO8583. Composable Agentic Platform has a special X Agent (accessible via administrator privileges) that allows for the definition of protocols.

Credential Vault

The credential vault provides a central location (only accessible to administrators) for storing user IDs, passwords, access codes and other data that is required to access external services. The information stored in the vault is automatically transferred to the X Agent upon deployment, eliminating the need for rule set writers to know any credentials.

Custom Functions

Custom functions are features that can be added to the Composable Agentic Platform console using rules. These same custom functions are subject to the normal Composable Agentic Platform console roles configuration and can be used to add simple maintenance tasks (such as blacklists) directly to the console so that they are accessible to normal console users without needing any other applications.

Databases

Databases are another fundamental concept of Composable Agentic Platform. In Composable Agentic Platform terminology, a database is anything that can be connected to via a JDBC driver.

JDBC drivers are connectivity modules for the Java language.

The term database refers to a system that is capable of storing data in a structured relational manner.

Alongside databases, there are a couple of terms that are important, which are listed below (Tables through Keys).

Tables

Tables are the name of the individual file within the database where a given set of data is stored. Sample table names are CUSTOMERS and ACCOUNTS. Some Composable Agentic Platform rules have the ability to create tables or read/write data to them.

Schema

A schema is a way to segment a database into multiple entities i.e. there can be two schemas on the same database containing the same table names. For example, there can be a PRODUCTION schema that contains an ACCOUNTS table as well as a TEST schema that contains an ACCOUNTS table.

Rows and columns

Rows and columns refer to the individual elements inside a table. For example, like a spreadsheet where all of the columns have a name.

Keys

Typically, all tables have keys that map to one or more columns in the table. In most cases tables have unique keys, which is enforced at the database level.

Input Adaptors

Input adaptors define how the X Agent interprets data that is sent to it. A large number of different adaptors such as XML, CSV, TST, HTTP and other data formats are shipped out of the box. New adaptors can be added, if required.

Users

Users, in the context of Composable Agentic Platform, are the users signing in to the console.

User Roles

Not all users signing in to the console will have the same roles. It is possible for all non-administrative users to have their access restricted to specific console functions.

Access Rules

Access rules can be used to define conditions of user access. Access rules can, for example, restrict a user to only be able to sign in from the local physical system, require a second-factor logon or call out to a single sign-in system for password validation. Access rules are plug-ins and the system can be extended with new rules on demand.

Repositories

Repositories are collections of configurations, rule sets and data. They work similar to folders in a normal file system to logically separate and control access to important assets. A typical set of repositories for any given system would be Test, Staging and Production – reflecting the lifecycle of the configurations, rule sets and data involved in a deployment.

Audit Log

The internal audit log provides a view of all events relating to administrative objects. Events such as logins, logouts, user profile changes and so on are all logged here.

Proxies

Composable Agentic Platform is a very network centric product. As such there are a number of network level definitions that can be confusing - especially when it comes to the large number of potential proxies being involved. The following is a list of the terms used for various proxies within and used by the product.

Web proxy

A web proxy is a proxy server that is installed as a performance optimizer or security feature between Composable Agentic Platform and the World Wide Web. Typically, this is a corporate proxy that Composable Agentic Platform must traverse to access services such as the update server, MaxMind, SMS services etc.

Composable Agentic Platform built in proxy (forwarder)

The built in Composable Agentic Platform proxy, called Proxy Server in the console, is a forwarding proxy used to protect sites that are unable to take advantage of the inline filter (e.g. all non-J2EE sites). It is also used to test Composable Agentic Platform rules against any site, without performing any installation. The latter is achieved using the built in browser proxy.

Browser proxy

The browser proxy is a feature of the built in forwarding proxy. The browser proxy allows configuration of a proxy within browser settings and have the requests from that browser sent through the Composable Agentic Platform built in proxy. This provides a convenient method for testing rules against sites without installing any additional software.

Architectural Scenarios

Composable Agentic Platform can be installed in a number of different ways. The following section highlights the most common scenarios.

Command and Control

Each X Agent is controlled from a Composable Agentic Platform Console. The console is a web application. Any information pushed to the X Agent from the console is stored in the X Agent’s home folder. This includes any software required to execute the rules.

It is important to note that the X Agent and the console are autonomous entities. They do not need to be connected for the X Agent to execute rules.

Simplest Form

In its simplest form, the X Agent receives data from a file (CSV, XML, Spreadsheet) and processes it by interacting with databases, APIs or other servers.

The X Agent in this form requires very little other than a software platform with Java and the X Agent core installed, a configuration file and a home folder. Upon deployment of rules, the console will deploy all other required dependencies along with the rules.

Servlet Filter

Installing the X Agent as a Servlet filter in a Java Application Server (such as Jetty, WebSphere, JBoss etc.), is a common approach. In this scenario, the X Agent is acting like a Servlet Filter sees all requests coming in and has the ability to modify the request before it reaches the web application. Similarly, is also sees every response coming back from the web application and it has the ability to modify these responses on the fly.

Use cases for this approach mostly centre around making temporary changes to an existing web application, out of band of release cycles, or when the web application is third party and not able to be modified.

Examples of such temporary changes include:

Adding security such as CSRF or SQL Injection protection
Frequently changed compliance rules that can be required on short notice

API Transformation

In this scenario the X Agent is installed alongside the Composable Agentic Platform Application Server and they work as an integrated whole, turning the X Agent into a high-performance HTTP proxy with the ability to provide SSL termination and on the fly transformation of requests and responses between the existing applications and the existing APIs.

Use cases for this approach include:

Vendor abstraction, which provides the ability to create generic APIs for things such as text messages, geo-location, two-factor authentication and use those APIs in the existing applications instead of vendor specific APIs
API Sunsetting, where the X Agent is capable of transforming the structure of an API call between different versions, such as to facilitate the removal of older version code from the API server source code
API Accounting, enabling chargeback of API calls that have a monetary cost to the respective users of that API

Active Web Proxy

This scenario is largely the same as the API Transformation scenario from an installation perspective. The only difference is that the requests in this scenario comes from end users rather than server applications and the target for the requests are existing web applications or Software as a Service applications.

This scenario lends itself to a myriad of use cases:

Digital transformation, where an existing application (often beyond the control of the business) is functionally enhanced, without the explicit need for the existing application being aware of these changes
Bot management, where the X Agent detects bots and adds policies for how those bots are able to access the underlying application
Robotic process automation where requests to the existing application results in data also being entered into secondary systems
Orchestration of multi existing applications that are joined together to form a new experience
… and many more

This scenario is especially useful for regaining control of web applications that are otherwise difficult, expensive or impossible to change

Web Application Server

In this scenario, the X Agent is configured as a proper web application server. It is capable of serving up content, including multi-media and other web assets. Rules are used to create a dynamic user experience.

Use cases include:

Web forms that capture data on a single page and disperse it into one or more locations
Dashboards that capture data from multiple different sources and serve up a singular view for all those sources
Fully fledged stand-alone web applications

Active Proxy With Content

With the active proxy and the web application server used in combination, in this scenario the X Agent can be configured to act as a proxy that has the ability to add content.

Use cases include:

Customisation of the user experience of a SaaS application, without the knowledge of the SaaS provider
Mobile enablement of an existing application without any changes
Adding two-factor authentication to an existing insecure application
Short lived campaigns and surveys that would otherwise clutter the target application code

Mobile Application Server

In this scenario the X Agent is configured to work as an application server with the ability to create client-side rules for validation, data storage and other mobile device features.

The client-side X Agent is running entirely in JavaScript and is portable across any mobile device with a browser. It creates a native look and feel mobile experiences using pure HTML5, CSS and JavaScript.

Use cases include:

Rapid mobile application prototyping
Offline/Online mobile data capture

Asynchronous Multi-Protocol

The Asynchronous multi-protocol scenario operates at the network packet layer and expands the reach of the X Agent from HTTP into other protocols. TCP and UDP packets are supported.

To facilitate this approach a secondary protocol level engine breaks down the packet into elements the main X Agent can understand. The X Agent can then modify these elements and the underlying protocol packet will be changed accordingly. The X Agent is then able to forward the modified package to the designated network endpoint and can even in some instances commence a chat with the endpoint before forming a response packet for the initiating computer. As always, the X Agent can rely on secondary data sources (APIs, data, other systems) to help form the modified request and response packets.

Use cases for this scenario includes:

Database field level security
ATM stand-in
SCADA security
Advanced DNS
… and many more

Data Loss Prevention Architecture

In addition to protecting internal websites from attacks and fraudulent activity, Composable Agentic Platform can also be used to monitor employee’s interactions with external websites (such as social networking sites, blogs and wikis). The most common use of this feature is for internal users to limit access to specific sites (for example Facebook and Twitter) for business purposes.

The following diagram illustrates how Composable Agentic Platform can be configured to monitor all traffic going to one or more nominated sites:

The key to this feature is to introduce a second DNS server within the company infrastructure. This second DNS server provides an override IP address for sites that are monitored by Composable Agentic Platform, ensuring that all the traffic is visible to the appliance. The monitoring appliance can be hosted and managed by an external service provider or can be installed in-house.

Getting Started

In this section we assume that the Composable Agentic Platform console application has been installed or activated and that you have access to a URL that brings up the login screen. If this is not the case, please refer to Installation and Configuration for instructions on how to install and configure the product.

Login

When you first access the Composable Agentic Platform console you are presented with a login screen as shown:

You can select the preferred language to use for accessing the system. If you change the language, the login page will change accordingly. The language you select will be stored as a cookie within your browser, so you only have to select it once.

You can now sign in using the user ID and password provided by the system administrator. If you are the system administrator and this is the first time that you are signing into the system, you can use the word admin for both the user ID and password.

Note that both user IDs and passwords are case sensitive.

Essential Things to do First

Before you start using the product, there are a number of important tasks that you should perform to get the most out of the product, and to secure the console from unauthorized access.

Console Setup

The first step is to click on the "Administration" section of the console and select "Console Setup".

You will see a page that allows you to configure a number of console settings:

Decide the type of the installation

The first and most important thing to decide is the console type. Composable Agentic Platform ships as a single distribution, but it can be used in a variety of configurations. By selecting the console type, Composable Agentic Platform will delete elements that are unnecessary for the specific installation. The Demo Server is the most suitable type for training and testing, whereas the other listed types are all production configurations. Once you have selected the type you wish to use there is no undo. If you select the wrong type you will have to reinstall the product.

Set the console web proxy

Unless the system where you have installed the console has a direct connection to the internet, you will need to configure the console's web proxy. If you leave your console without internet access, then you will be unable to receive product updates, new extensions, case studies and fixes.

If you are behind a web proxy that uses Microsoft NTLM authentication, you must also set the Web Proxy Domain value. For Microsoft NTLM to work correctly the Web Proxy Host should be the fully qualified network name of the proxy,

for example: mywebproxy.mycompany.com,

and the Web Proxy Domain should be the simple name for the domain.

Set the console email server

Having an email server defined for the console is an essential step in ensuring that you can reset lost passwords and recover forgotten user IDs. It is also a requirement if you intend to use the email second factor login method.

Please note that unless a mail server is defined, there is no way to recover a lost password. It is often important to set the email sender as many SMTP servers are configured to reject email from unknown senders.

Save the settings

Once you have defined your console type, web proxy and email setup, make sure you click the save button to store the settings. If these settings have been edited, you must restart the console server.

Manage user accounts

After the console has been configured, you should ensure that the list of authorized users is correct by clicking on the "Administration" section of the console and selecting "Users".

Set up a new account

As a minimum, you should set up a new personal account. The User ID is required to be at least 6 characters long and may not contain spaces.

Supply your real name, email and set the user type to be Administrator. Administrators are not required to have a user role, but it is a good idea to provide your time zone as this will make reports and search queries match your local time.

Finally supply a strong password twice and click on Create.

At this point we recommend that you log out of the console and log in as your new user.

Manage default accounts

By default, the console ships with 3 active accounts: admin, super and security. All of these accounts have elevated access to manage the console and should not be left with default passwords (which for all of them is the same as the user ID).

If you decide to keep these accounts, as a minimum you should change their passwords and supply them with a valid email address that can be used for a password reset.

Keeping the Product Current

After you have completed the console setup, click on Console at the very top of the administration tree:

If there are updates, fixes, new rules, case studies or other material ready on our update server, there will be a message about those updates.

To see the updates available, simply click on the notification.

Please note that the updates available depend on the type of server licenses you have installed

The update screen will appear as shown below.

Depending on how your console was shipped, downloads will include new or updated rule examples, new or updated extensions and updates to demo and the console application itself.

Note that brand new updates are marked with the BETA tag for 7 days after their release. This is done to allow you to apply updates conservatively.

Installing updates

To install a given update, simply select it and click on Install selected (alternatively you can install all updates available by clicking on Install all).

If you choose to download a console update, please be patient as they can exceed 25Mb in size and can take several minutes do download and install. Once the download update is completed, simply log out of the console, wait around 30 seconds, and log back in to get the new version. Any users who don’t log out will simply remain on the old version until they do.

Note that if you install a new application version, you should always clear your browser cache to pick up any changes.

Updating when the console has no internet connection

In some instances, the console does not have direct internet access. To still facilitate the download of updates, Composable Agentic Platform can use a fallback mechanism known as CORS (Cross-Origin Request Services). This essentially allows the console to reverse proxy through the browser used to access it. At the time of release only the latest versions of Chrome and Firefox support this web technology.

Viewing Active Servers

Once you have logged in, you will have visibility of all of the Composable Agentic Platform servers defined within the console. You can see the status of each server by expanding the Servers section of the administration tree as shown below.

Servers with a green tick in front of them are recognized as being online and available. Servers with a red exclamation mark are offline and unavailable. To see the status of an individual server, click on the icon in front of it. An example of a server’s status is shown below.

Filtering servers

If you have a lot of servers, you can filter them by host name, port number or description.

The filter stays in place, not just for the active servers but also filters the list of servers accessible for deployment.

Quick Product Introduction

Now that your console is fully operational, we are ready to take you through a basic example that illustrates how to use it. Our example will show you how to remove all advertising from Google search results.

Even though this example has limited real world practical use (unless you wish to run it on your corporate internet gateway), it provides a basic case study that shows many fundamental features.

Preparing the Browser Proxy

The first step in our example is to prepare the browser proxy so that all traffic to and from Google is successfully routed via the Composable Agentic Platform Proxy Server. This will give us visibility of the data and provide all of the information we need to manipulate it.

Many browsers have in-built security features to prevent user access to websites whenever there is an untrusted SSL certificate, and will block the incoming request without exception

In our example, because it is not possible to install Google’s SSL certificate to the Proxy Server, overcome this by using redirection settings within the Proxy Server. In Administration, Server Definitions click on the Proxy Server as follows.

Click on the Forwarding tab and set the Request redirection properties for Google as follows. Our example is for a UK IP address request, which follows the redirect of Google.com to Google.co.uk based upon the IP geolocation from the originating browser.

The first line entry is for example format use only and has no impact on the Proxy Server:

http://thishost>http://thishost:8001
http://google.com>https://google.com
http://www.google.com>https://www.google.com
http://google.co.uk>https://google.co.uk
http://www.google.co.uk>https://www.google.co.uk

Once you have input the redirection settings, scroll to the bottom of the page and save the modified Proxy Server definition.

The Proxy Server will now successfully route the http to https protocol redirection and allow the browser to access the website even without a correct SSL certificate.

Next, deploy a configuration to the Proxy Server. The configuration we will use in this example is the one named BasicWebTrial, which is under Configurations->Product Trial in the administration tree:

When you click on it, you will be presented with a number of options:

At this stage we are not going to make any changes to the configuration, only the changes made earlier to the Proxy Server server definition.

So now deploy it and start the Proxy Server by clicking on Deploy.

You will see a choice of servers you can deploy to:

Select the Proxy Server as shown, check Restart immediately and then click Deploy. You will then see the action window switch to the server view showing the configuration and all of its dependencies being deployed to the proxy:

Once complete, you will see that the Proxy Server is started and ready to use:

Setting up the Proxy in the Browser

Now that the Proxy Server is running, the browser needs to be configured. There are a number of different ways of doing this, depending on the browser of your choice.

Our preferred method is to use one browser (e.g. Chrome or IE) for managing the console and another browser (e.g. Firefox) for browsing via the Proxy Server. The advantage of this approach is that Firefox has its own local proxy settings allowing us to run basic queries and other web browsing unrelated to our testing in the non-proxied browser.

Note: When using the Composable Agentic Platform browser proxy for accessing secure web sites over HTTPS, you will encounter certificate warning in the browser. These warnings are relatively easy to get around by clicking on the Advanced button and adding an exception. However, with the advent of HTTP Strict Transport Security (HSTS) this has now become impossible to do as the browser will refuse to add the exception.

The Browser Certificate Installation Guide (in the documents folder) provides instructions on how to overcome this problem by installing a trusted certificate authority into your browser that Composable Agentic Platform in turn will use to generate valid replacement certificates for each SSL site on the fly.

The following shows how to configure the browser proxy in Firefox Quantum 60.0 on Windows 2012 Server:\

Select Options then click on Network Proxy > Settings:

Set the proxy options as shown below:

Verifying the Browser Configuration

Now we can verify the browser and proxy configuration. In the browser you chose for browsing via the proxy, type www.google.com in the URL (address) bar and hit enter. You will see the country specific main Google page:

Now switch back to the browser running the console. You should see some activity in the server console viewer. You can enlarge the server console viewer to get a better look:

Without going into too much detail at this stage, what you are seeing is the browser request for each interaction that the browser had with the requested host. You can see items such as the IP addresses, User agent (Browser), Request URL, request method, cookies, protocol scheme etc. This is by no means an exhaustive list of the data Composable Agentic Platform can detect but gives you a general idea.

The thing to take note of at this stage is that you can see all requests, including requests for images as well as JavaScript, CSS and other page elements. This is an important thing to be aware of when writing rules.

Understanding the Configuration

It is now time to take a closer look "under the hood" to give you an understanding of what just happened. The first thing to look at is the configuration that we just deployed. Select it again for a closer look:

Configurations are what tie a solution together.

Each solution consists of a number of building blocks which can include several rule sets, data files, content files, database configurations, field settings, input source definitions and much more:

To learn what this configuration does, you can review each of the various tabs and look at each rule set. Alternatively, click on Document, and select a target server:

Select the Proxy Server and click on Document.

A new page will appear that contains a complete summary of the configuration:

This page is specifically designed for printing a given configuration for audit purposes, but it is also an excellent way to get a quick understanding of what is going on in a rule set. Just focusing on the rules in this case, scroll to the bottom of the document:

The rule set shown (BasicWebLister) is executed whenever a request is sent from the browser to the server. The rule set is effectively a flow chart, executing from the green dot on the left through the rules towards the right. This is a very simple rule set with no decisions, so the flow should be very clear.

The summary page below the rule set shows the properties set for each rule, but for the sake of understanding, we will elaborate a little further:

The first rule executed is the HTTP Request Tracker rule. This rule takes a basic HTTP request and extracts all of the common header attributes from it (header names, request URL, tracking cookies etc.) and places that information in variables. It also sets tracking cookies (if Use cookies is set to Yes).
The second rule is the MaxMind Geo Info rule. It uses the IP address supplied on the HTTP request and attempts to convert it to a physical location (country and city) using the MaxMind Geo Location database. In this case, the rule returns nothing, as the localhost IP address (127.0.0.1) doesn't resolve to any country in the data lookup.

Finally, the List Variables rule sends all of the variables that have a value to the server console viewer so that the user can examine them, which is what you saw earlier.

The purpose of the configuration we just deployed and tested is to obtain the HTTP request data, augment it with Geolocation and then send the information to the console. If you scroll down the server console viewer, you will notice the various requests coming in, including requests for images, style sheets, icons and so on.

Understanding input and variables

Every X Agent receives data in the form of variables. These variables are initially supplied by an input adaptor. The most commonly used input adaptor receives web application input, but other adaptors receive XML data, CSV data or other more complex input.

For the purpose of understanding the above rule set, the web application input adaptor supplies the variables REQUEST_URL, URI and REQUEST_TIMESTAMP. It also supplies as variables any parameters provided by GET or POST requests. To obtain more detailed information about the HTTP request, the HTTP Request Tracker rule is used

The reason for this separation is that you may not need all of the detailed information for most requests (such as images). This example provides a quick window into the world of Composable Agentic Platform. The next step is to create a configuration that will have a more interactive result.

Preparing a new repository

The first thing that is required for a new configuration is a new repository. All data, rules, content and so on, live within Composable Agentic Platform in a repository.

To create a new repository, click on Repositories, enter the name as “Google Ad Remover” and click Create.

This will create the repository. The next step is to figure out what our rules should do. This requires a closer look at what Google does with their search results.

Locating the Page to Modify

Start by actually running a query. For the purpose of this example, go to www.google.com and query the word dishwasher. You will get a country specific page similar to the one shown below. If you don’t see any ads at the top or on the right look at the bottom of the page. In our example, we are using www.google.co.uk.

The goal is to remove the ads along the top, and the ads along the right-hand side.

The next step is to work out how to go about removing the ads.

Determining the Actions Required

There are a number of different ways to work out what actions need to be performed in the rules. In this case, the only action is to alter the response, so we need to determine where to make changes. Browsers like IE, Chrome and Firefox all provide developer tools to help identify specific elements in the page source code. In all of those browsers hit F12 to access the debugging tool if using Windows. For other platforms, please check the browser help instructions for how to access the tool. They all work in a similar fashion, but we will just cover Firefox Quantum version 60.0 operating on Windows 2012 Server in this example.

Click on F12 to open up the Inspector:

Click on the html inspector tool:

Now select the sponsored ads box:

This is where it is useful to know HTML, especially when dealing with a multinational site such as Google, as the tags tend to change from country to country. In our example, it is worth noticing that there are various advertising tags output within the source of the page.

There is a DIV with ID “rcnt”. To make the ads disappear you need to hide the tag using inline css styles.

To accomplish this:

</head>

becomes:

<style type="text/css">#rhs, #tvcap {display: none;}</style></head>

With this information to hand, the next step is to start building a rule set.

IMPORTANT NOTE: Individual versions of Google will differ depending upon operating system, browser, and country. Make sure to work out the right way to make this modification in the version being used.

Building the First Rule Set

Normally the starting point for a new rule set is using the New rules wizard. We will cover that later, but for the purpose of simplicity this exercise will instead build a new rule set from scratch. Return to the console and click on Rule Sets:

In the action window select your new repository and give the rule set a name (in this case NoAds) and click on Create:

Note: The rule set name should always be a single word with no spaces.

A new rule set is created, ready for us to edit:

Click on Update to start editing the rule set. A pop-up window will appear showing the rule set in the Rules Editor:

Note: If no pop-up appears then check your browser's pop-up blocker. Pop-ups (though blocked by some users) are useful for the rules editor. It allows you to have many rules editing windows open at the same time and edit them all concurrently (including copying and pasting between them).

We encourage you to expand some of the elements of the Rule Catalog to see what is available. The complete rules reference is also available as a PDF document from the main console page.

At this stage you should also add a short description of what your rule set is going to do. Do this by clicking on the Rule Info tab and keying in a short description of the purpose of the rule set:

The next step is to start building some rules to handle the search result. The first consideration is that the rules should only apply to search results, not items like images, CSS and the like. Normally the New rules wizard would insert a special rule to take care of that problem, but with Google there happens to be a very simple solution: Any request that has a dot (.) in it, is sure to be non-HTML.

Other sites may use some other consistent extension for pages (such as .php, .jsp or .html), but for Google it is pages with no extension at all.

Therefore, our first task is to filter out all requests with a dot (.) in them, and to do this, we need a condition. Expand the Conditions group and drag an If Condition from the tree onto the canvas:

At this point, move your mouse over the If Condition on your canvas, right click it and select Help. The expanded help for the rule appears:

All rules have this help available. In addition, in the bottom left corner of the rules editor you will also see a summary help notice:

These help features are often useful when trying to find the best rule to suit a specific purpose (as some rules may sound very similar).

Manipulating the Server Result

The next step is to change the actual server response before it is sent to the user. In our case this change consists of the html string replacement we identified earlier. The rule for a string replacement is called String Replacer. Locate it and drag and drop it onto the canvas. How to connect it up should be easy now:

Notice that we connect both the Found and NotFound chain points to the following rule.

We do this because not all Google pages display ads. This time the properties are set as follows:

Returning the Result to the User

We have only one final step to complete the rule set, we must return the changed response to the user. This is done with the HTTP Response rule:

The final properties are set:

Our rule set is now complete, so save and exit the editor.

IMPORTANT: If you are using Google Chrome to edit the rules make sure to hit the Save button in the rules editor before closing the pop-up window.

Tips and Techniques for Working with the Rules Editor

The rules editor is designed to be easy to use. There are however a few tricks that will make using the rules editor even easier. The following covers a few of these tricks.

Selecting multiple rules

If you need to move multiple rules around in unison or select them so that you can copy them to another rule set, hold down the CTRL key whilst clicking and selecting rules. Alternatively, you can hold down the mouse key and drag a rectangle around the rules you wish to select.

Cut, copy and paste

Rules can be cut, copy or pasted within the same rule set or between open rule sets by right-clicking whilst the rules are selected. To paste the rules into a new position, right-click on the canvas where the rules should be placed and select Paste.

Note: Not all browsers support the right-click feature. For this reason, edit options can also be obtained by holding down shift and left-click.

Disconnecting chains

If you mistakenly connect one rule to another, you can remove the connection by right-clicking on the chain point and selecting Disconnect.

Distinguishing between variables, text and numbers

Many of the rules in the system take a variable, text or a number as a parameter. Generally speaking, variables may only contain letters, numbers and the characters: ‘_’, ‘:’ and ‘.’, and they must start with a letter.

There will be times where a variable can be confused with a text literal, and for that reason text literals should always be enclosed in double quotes. For example: ABC is a variable whereas “ABC” is the text ABC.

Numbers on the other hand are unambiguous and can just be keyed as numbers.

When entering a CSV list of values, there is no need to enclose the entire block in double quotes. Since CSV text has commas in it, it will automatically be detected as a list of string elements.

Don't close before testing

There is no need to close a rule set to test it. You can keep multiple rule sets open in multiple windows, deploy your rules, test and then return to the already open windows to continue editing. Click the save button rather than the close button to stay on the page.

Regular expressions

Several of the standard rules make use of a string matching feature known as regular expressions. Books have been written about regular expressions and it is beyond the scope of this manual to cover more than the very basics.

In its most basic form, you can use regular expressions to see if a certain text is available within another, to count characters and to look for certain pieces of text at certain positions within words.

An example of a regular expression would be: (ab|cd)

This expression will check if a text contains the character sequence “ab” or the sequence “cd”. So both “baby” and “lcd” would be a match.

The following tables list some of the common uses of regular expressions and how they can be used to validate text.

Character ranges

Counting

Grouping and alternation

Positioning

Lists and Data Sets

Lists and data sets provide an efficient way to work with keyed data that needs to be stored either in memory or a database.

Lists are capable of storing normal variables, lists or data sets, allowing effectively for multi-dimensional arrays.

There are two types of lists: Regular lists and fixed sized lists. When you insert a value into a list, the list will automatically be created as a regular list if it hasn’t already been created.

Fixed size lists are extremely useful for memory caching. Elements inserted into a fixed size list will stay there until the maximum size of the list is reached. Once that happens, the oldest element that has not been accessed (read or updated) will be removed from the list.

Elements inserted into a list must have a fixed key. If a new element is inserted with an existing key, the existing element will be replaced.

It is possible to create global lists by creating the list in a startup rule set and then setting it as a global variable. When the global list is read from a normal rule set, any changes made to that list as a local variable will directly affect the global list. This provides a means for caching data at a global level.

Data sets provide a way to create a collection of correlated data. For example, you can define a data set called “Fruit” with the properties Name, Color and Shape. Once you have defined a data set, you can create instances of it in a database or in a list. For example: Apple, Red and Round.

The X Agent will automatically handle the correct storage of the data set in a database and properties of the data set can be added and/or removed at any time in its lifecycle. So, if at a later stage you need to store another property in your Fruit data set, you can simply add it to the definition.

Data sets should be defined in a start-up rule set and can only be defined once within the life cycle of a deployment.

Data sets can optionally contain a number of lists. However, lists stored within a data set may only be single dimensional (you cannot have lists within lists).

Once a data set is stored within a database or a list, you can read it by key, delete it, update it etc. To update a data set within a list, you simply create it again with the same key name.

Using the New Rules Wizard

A quick way to get started building rules for a new web application or for stress testing is to use the rules wizard. The rules wizard uses live test data to extract URIs visited and build a structured collection of rule sets.

Selecting or creating a repository

The rules wizard creates a large number of files, so we strongly recommend that you create a new repository to write the new files to.

Preparing the data for the wizard

Before using the rules wizard, you must first deploy and start either the RuleWizardConf or the StressTestConf configuration found in the Rules Wizard repository to a test server that is protecting the target application (alternatively you can use the zero installation test method which is covered later in the manual). Once done, simply begin navigating the various components of the target web application.

Once you have visited all of the pages you wish to cover with your initial rule set, return to the console and go to the server status screen:

In this example the Qwerty demo application was chosen and there are 19 test records ready to be processed.

Creating the rule sets and configuration

Now click on the "New rules wizard" button. You will be presented with the following page that controls the wizard:

At this point you can have the rules wizard create a filter for the rules that automatically exclude static content. It is a comma separated list and you can easily add new elements.

Make sure that you select either New rule set or Stress test, depending on your requirements.

When you have selected an appropriate repository (in our case, we have use the repository ‘Name’) and reviewed the exclusion list, click on "Create".

After a brief pause, the X Agent will write out a complete configuration and collection of rules. The following pages show how these rule sets are structured for a new rule set:

Understanding the Load page

In keeping with best practice for rules writing, the rules wizard always creates a "Load" rule set. This contains rules that are generic for all URIs. The name of the page will be the name of the repository followed by the word "Load".

This rule set will first check for malformed HTTP requests and, if found, will reject them. Subsequently it filters out static content, adds a tracker rule and then proceeds to the main rule set.

Understanding the Main page

The main rule set page contains a basic structure that determines the URI being visited and then uses a switch to re-direct to a rule set covering that URI.

Each one of those rule sets, in turn, are blank, but are already created and ready to have rules added to them. The new rules wizard creates a quick foundation to get you started with writing rules for your application.

Using the page hints

When the rules wizard creates each of the blank templates it includes sample information of the fields gathered and their values in the page description:

You can use this information to identify the fields available to your rules.

Using the New Rules Wizard for stress testing

As described above, an alternative use of the New Rules Wizard is to create a set of rules for stress testing an application. To do this, you must first deploy the StressTestConf configuration from the Rule Wizard repository to the web application on the browser proxy.

Preparing the data for the stress test

Once deployed, you should work through the application to be stress tested, step by step. Try to complete pages as normally as possible, making sure not to pause unnecessarily during the process, as wait times are recorded.

During the new rules wizard creation, select Stress test instead of New rule set.

The following pages show how the rule sets are structured for a stress test:

Understanding the Load page

This rule set allows you to set up a multitude of things. First of all, the target server identity, and also the user agent to use. It is very common to modify the Load rule set to pick up randomized values for elements such as users. The following shows a modified Load rule set to include diversified User ID and Password configurations for a Qwerty stress test:

In this scenario, the users and their passwords are picked from a CSV file of valid entries. The settings for the CSV Line Picker rule are as follows:

It is important to notice how the THREAD_NO variable is used to pick a line in the CSV file. The THREAD_NO is incremented for every stress testing agent and should the number exceed the number of entries in the CSV file, the rule will start reading from the beginning again.

Understanding the Main page

The main rule set page contains a basic structure that determines the URI being visited and then uses a Number Sequencer rule to re-direct to a rule set covering that URI.

Understanding the Wait page

The wait rule set page is used to define wait times. This delay can be fixed, or it can be a function of the recorded wait time during rule set creation.

The following shows a wait time delay:

The wait time properties can be set as follows:

The default for the percentage random is zero, but it can be randomized to create a more realistic user load.

An alternative is to either remove the wait time completely or insert a fixed delay:

Understanding the Fail page

The fail page is used to define any action to be taken if a stress test page invocation fails. The default is to abort the flow and stop the thread:

Understanding each stress page

Each page that is detected will be assigned a unique page name and will be given a unique rule set. The following shows the main page in the Qwerty application:

In this case, the page is a GET and the invocation is relatively simple. For POST requests, the structure looks slightly different:

In this case, the recorded POST variables are set up in a single rule. You can override these POST variables to use pseudo random values (as shown previously in the Load rule set).

The individual pages are where you should include test data, and possibly review the response data from significant pages to ensure that the stress test is progressing well.

Configuration Settings

Each configuration is divided into 6 tabs with settings. In this section we cover these tabs in detail.

The General tab

The General tab contains all of the basic information about a configuration.

File name

The file name should be a single word (no spaces). You can rename the configuration by changing the name and clicking "Save".

Description

The description provides an easy way for other people to understand what your configuration does and serves as basic documentation.

Rule Set

Every configuration must have a rule set. This is a mandatory field.

Content Rule Set

Content rule sets allow you to manage specific content (new pages, images and so on), that you can introduce to the application. If the target server definition has a context path set for serving content files, you can optionally check the Use server context path checkbox to use that as the defined directory path.

Startup and completion rule sets

If you have rules that should be executed before your main rules, or immediately after they have all completed, you can define them in the configuration.

Please note that if you set a startup or completion rule set, the X Agent will be restricted to running in a single thread to ensure the correct order of events.

Modes

Modes are named collections of rule sets and content rule sets that can be used to replace the default rule sets that are running. For example, if you wish to take a website offline for maintenance, you can create a “Maintenance” mode and assign it to rules that display a maintenance page instead of your normal website.

The Input source tab

The input source tab provides details to the X Agent about where input is coming from and how to deal with it.

From Server Type

The server can be either Production, Multi-Protocol, or Test. This makes the configuration target a specific server type and determines which input adaptors are available to select from.

Input adaptors

A critical part of the configuration is the input adaptor or “source of data”. The options available depend on the type of server selected. As a general rule, the file name or URL being processed will be made available by the input adaptor as the variable URI (Uniform Resource Identifier).

For file names, this includes the full file path in the file system dependent format.

Input adaptors are frequently added via extensions. At the time of writing, the following input adaptors are available by default:

Execute a load test against a server: This input adaptor is only available for production servers. It allows you to take a stress test rule configuration generated by the “New rules wizard” modified to suit the application and use that to generate a load against a website. You can control ramp up times and total threads as well as think times.
Please see the “Using the New Rules Wizard for stress testing” section in this manual for more information.
Process multi-protocol input: This input adaptor is only available for Multi-Protocol servers.
It allows you to take input from any protocol defined within the administration section of the console and control the input, proxy and output of that protocol.
- Protocols supported include (but are not limited to): MySQL, DNS, Telnet, FTP, ISO8583 and SMTP.
- Transports include: SSL, TCP and UDP.
  Please see the “Case study: Multi-Protoco l” section in this manual for more information.
Process a single CSV file: This input adaptor is only available for test servers. It allows you to define each column in a CSV file that you wish the server to process. The file must be present amongst the test data files uploaded to the console.
Process a single multiline CSV file: This input adaptor is only available for test servers. It allows you to process CSV files that have records spread over more than one line. Typically, this could be a file that looks as follows:

Record ID

Record Type

Value Column 3

Value Column 4

12345,

R1,

John

Doe

12345,

R2,

Melbourne

Australia

23456,

R1,

Bob

Smith

23456,

R2,

London

United Kingdom

34567,

R1,

Jane

Doe

34567,

R2,

Auckland

New Zealand

To process the above file, you would need the following definition in the Input Fields tab of your configuration:

The break column (BREAK_COLUMN) defines which column number is used to identify a unique record. The record column (RECORD_COLUMN) defines which column number contains the record type.

Each individual field for an entire record is then defined as a field name, with the label indicating which record and column number that field can be found in.

Process a single identifier delimited file: This input adaptor was designed to rapidly traverse files that contain somewhat structured data, where each piece of data is preceded by a recognizable identifier, and all of the identifiers are in the same order (although missing identifiers are tolerated).
This adaptor relies on the data in the file containing a format where an identifier can be used to spot breaks in the data. The following example illustrates how this adaptor can be used:

Sample data that could be provided to this adaptor could be:

Card Number: 12345678910 Amount=123.45 Order=12345
Card Number: 34567891012 Amount=56.78 Proforma=67890

The first label listed in the input fields for the configuration MUST be the break for each new record. A record can be on more than one line in the file.

Processing the above shown data would result in two records, the first with the variables set as follows:

[CARD_NO]=[12345678910]
[AMOUNT]=[123.45]
[ORDER_NO]=[12345]

The second record contains:

[CARD_NO]=[34567891012]
[AMOUNT]=[56.78]
[PROFORMA]=[67890]

Strings are always being trimmed of leading and trailing blanks but can contain more than one word. If you have identifiers in the file that you wish to ignore, you must still specify them in the list or they will be considered part of a context of a previous identifier.

Process a single XML file: This input adaptor is only available for test servers. It allows for an XML file to be processed by the X Agent. Each XML tag in the file (and its attributes) will be converted to a unique variable name. For example, the following XML document:

<helloWorld world="”Earth”">
  <name>John Doe</name>
</helloWorld>

results in the following variables being generated:

helloWorld.world=Earth
helloWorld.name_1.text=John Doe

It is important to note that all tags below the root tag will have a counter attached to them to ensure uniqueness. This is what results in the “_1“ being added to the “name” tag in the example above.

If more than one “name” tag is present, the conversion will be as follows:

<helloWorld world=”Earth”>
            <name>John Doe</name>
            <name>Jane Doe</name>
</helloWorld>

Which results in the following variables being generated:

helloWorld.world=Earth
helloWorld.name_1.text=John Doe
helloWorld.name_2.text=Jane Doe

As this process can result in some rather long variable names (especially when processing XML documents such as SOAP requests), the use of the Alias rule is encouraged to simplify rule writing.

Process all CSV files in a directory: This input adaptor is only available for production servers. This input adaptor will look for files in a folder/directory. When one is present, it will process it and then delete the file. Each field in any supplied CSV file must be defined in the configuration.
Process all identifier delimited files in a directory: This input adaptor is only available for production servers.
This input adaptor will look for files in a folder/directory. When one is present, it will process it and then delete the file. The data within the supplied file is converted into unique variable names as outlined in the “Process a single identifier delimited file” adaptor.
Process all multiline CSV files in a directory: This input adaptor is only available for production servers.
This input adaptor will look for files in a folder/directory. When one is present, it will process it and then delete the file. The data within the supplied file is converted into unique variable names as outlined in the “Process a single multiline CSV file” adaptor.
Process all XML files in a directory: This input adaptor is only available for production servers.
This input adaptor will look for files in a folder/directory. When one is present, it will process and then delete the file.
The tags within the supplied XML document are converted into unique variable names as outlined in the “Process a single XML document” adaptor.
Process free format test data: This input adaptor is only available for test servers.
This adaptor is specifically designed to receive data from a file generated by the “Test Data Creation” rule (TST files). There is no need to define any input fields as the data within the file are composed of a field definition list as well as a data value list for each record.
This adaptor is designed to process data from production servers that process web application inputs with the actual variable names changing for each request.
The test server will be able to emulate what happens on an actual production application server without the requirement to simulate anything in a test environment.
This adaptor is also useful for pre-testing any new rule set to evaluate the impact of installing it into production.
Process on heart beat: This input adaptor is only available for production servers.
This input adaptor is used to process the same rule set at regular intervals. You can specify the delay between each run in ms.
Process once and stop: This input adaptor is only available for production servers.
This input adaptor will run the rule set once upon startup and then stop. This is predominantly used for testing rules.
Receive input via HTTP POST: This input adaptor is only available for production servers.
This input adaptor is designed for high-speed processing of a specific HTTP POST (for example from a known JSP or HTML page). Each field that the X Agent is expected to process must be defined in the configuration, just as if the input came from a CSV file.
The field names listed must be the same name (case sensitive) as they appear in the form post from the HTML that submits the request.

It is important not to confuse this adaptor with the “Receive web application input” adaptor, which is slightly slower but much more flexible.

Receive web application data: This input adaptor is only available for production servers.
This input adaptor is probably the most flexible, but also most complex. It is capable of receiving data from any HTTP request, be it a GET or a POST, and translate it into variables that can be used by the X Agent.
The adaptor understands and translates standard HTML, XmlHttpRequest (AJAX) and SOAP requests as long as the appropriate content type is set in the HTTP request.
For HTML POSTs and GETs, the URL parameters and form fields are translated directly into input variables, with each variable name matching the corresponding parameter or field name. For XmlHttpRequest and SOAP requests, the tags within the supplied XML document are converted into unique variable names as outlined in the “Process a single XML document” adaptor.

This particular input adaptor allows you to enforce some web application security settings:

For HSTS:

At the very minimum you must provide a ‘Max age’ value in seconds. The most recommended value is 31536000 seconds (one year).
Optionally you can check the box to include sub domains.
Preload is a method whereby the most common browsers will load a list of sites that MUST use HSTS. Google maintains a list of sites that are preloaded as requiring HSTS and that list is used by the Chrome, Safari and Firefox browsers. To have your site registered as preload, you must apply here: https://hstspreload.org/. Google will verify that you have the preload flag set against the HSTS header before adding you. If you do not have this flag, Google will reject your application to be added to their list.

Note: Any of the above settings that modify cookies require that you are running on a Servlet Specification 3.0 or later web application server. For the standard installation that means Jetty 9 or later.

Input adaptor specific fields (Production)

After the selection of an input adaptor in a configuration, there are a number of fields specific to that adaptor. The largest difference is typically between test and production adaptors, we will show two examples here:

The above scenario is for a production input adaptor that processes all files "dropped" into a given directory. The input adaptor will poll that directory and whenever a new file is added it will be processed and then deleted.

The additional fields are as follows:

Collect Test Data

Selecting this option causes the X Agent to always collect test data by default. You can also start collecting test data on demand using the server status view.

Max Test Records

This is the maximum number of records that the X Agent will keep in memory. For data with a large record size, this value should be set properly to avoid retaining too much memory.

Sub-directory

This is the sub-directory from the home folder where the X Agent will look for files. For other input adaptors that do not use files (such as the web application adaptors) this can be a different field name providing different information.

Auto Start

Selecting this option will cause the X Agent to automatically start when the server is started. There is no need to selectively click the start button.

Input adaptor specific fields (Test)

Test servers behave differently to production servers in that they always take a single file as input data, process that file and then stop. The following is an example of the settings for a test input adaptor:

Test data

Test data is the name of the file to process. This file must be in the test data section of the console tree in the same repository as the configuration.

Testing flags

The testing flags are used to control how the X Agent interacts with the environment around it.

Flag

Usage

Update Internal Data

If selected, the rules in the X Agent will update data with tables that it can directly access. This includes the internal case manager.

Update External Data

If selected, the X Agent will write data to external systems that are not database connected. This could, for example, be an external case manager that receives cases via a web services call.

Send Alerts

If selected, the X Agent will send alerts such as emails, SMS messages and other forms of external messages.

Remaining input source fields

The remaining input source fields are generic. The following is a list of their meanings:

Echo console to System Log

This flag determines if messages written to the console (via any of the List rules) are also written to the system's standard out log.

Enable Debug Mode

Enable Debug Mode turns “List Variable for Debug” rules as well as “Exit” rules with “List Variables on Exit” set to “Debug mode”, on, so that all variables will be listed at selected points throughout the rule sets.

Fail open on fatal error

The Fail Open setting determines how the X Agent deals with a fatal error. If selected, the X Agent will automatically stop and let all normal traffic proceed transparently should a fatal error occur. If unselected, the X Agent will attempt to recover from the failure and continue running.

Maximum chain events before run is considered looping

This setting is used to control how the X Agent detects infinite loops. Effectively every connection (chain) between rules has a counter built into it. When the number of chain events reaches the count set here, the X Agent will consider itself looping and will terminate to avoid impacting other services.

Performance collection level

This setting determines how much performance data is collected as part of the X Agent execution. Please see the performance data section for more information.

The Input Fields tab

Input fields are used for a variety of purposes. They can be used to identify column settings for input adaptors and also to determine global settings that can be changed at the configuration level without changing any rules. The following shows such an example:

The Global Fields tab

When the X Agent is running, it is possible to set global fields. These are fields that can be accessed by the X Agent at any stage and are not dependent on input from other sources. Global fields can be changed during the execution phase of a rule set, allowing you to potentially alter the flow of rules, set different thresholds or check for different conditions.

The following shows an example of defined global fields:

It is important to know that global fields are persistent. This means that the default value set in the configuration only applies for the very first time a global field is set. After that point, the global field retains its set value, even after the X Agent is restarted.

The field name is the global variable name set when the value is changed. The Label is the label that the user sees.

The field type for each global field is important, as are the allowed values. You can set fields types as follows:

Text: This creates a simple text field that can be changed. The allowed values have no effect.
Number: This creates a simple text field for numeric input that can be changed. The allowed values have no effect.
Switch: This creates an on/off style switch that can be changed. The allowed values represent the values set for the ON condition, followed by the OFF condition.
Slider: This creates a slider that can be changed. The allowed values represent the min value, max value and optionally a third value representing the increment. Only integers can be used.
List: This creates a drop-down list of values that can be picked. The allowed values represent each of those selections.

The following shows an example of how the above defined values are displayed in the server settings change function:

The Databases tab

Many of the rules available in Composable Agentic Platform are capable of accessing data in databases, either locally to Composable Agentic Platform or externally stored somewhere within the network.

As a user, you will need to know the name of the database that you want to connect to, and in some cases also the table and schema names.

Configurations need to list all of the databases that the rules within them are capable of accessing. This allows the deployment system to provide

Additional information to the Composable Agentic Platform servers about how to access those databases at a technical level. The databases themselves are normally defined by a system administrator.

The following shows a configuration of a sample database:

You must enter the database name and the type of database (driver). The list will vary depending on the types installed on the network.

If you are writing rules that may be used on different systems where the database names may differ, you can use a database alias name. The database alias name in the rule sets will be mapped back to the database name defined in the configuration.

There may be times where you wish to access to a database with a schema name that varies between test and production systems. To allow this, you can override the schema name in the configuration. If you leave the schema name blank, the default value configured by the system administrator will be used.

The system defines where the actual database is located. You can combine this with the defined servers to allow JDBC drivers to connect to any given location.

The Timers tab

You have the ability to set a list of rule sets that execute at a given time interval. These rule sets are independent of any input data and simply run on a repetitive cycle. There are two types of cycles: Delay and Real Time:

Delay timers simply execute the rule set, then sleep for the delay period and then execute again.
Real time rule sets will run at a precise interval, regardless of the time it takes to execute the rules.

An example of a timer setting is as follows:

In the above example, the OnTimer rule set will be executed, then pause for 30 seconds and then repeat.

Please note that the data object used within the timer stays the same. This means that you can set variables within the timer rule set and use those same variables the next time the rule set executes (for example variables for a counter).

Note that for web applications, the timers will not start until the first real transaction has been processed.

Creating documentation

For each configuration you create, you have the option of producing a complete documentation set. To create this, click on the Document button:

Then select the target server and click on the Document button.

A pop-up window will appear that lists a complete view of the configuration, the selected server, JDBC drivers, databases, rule sets, data files and so on.

Selectively you can include the actual details of the contents of data files and content files. For each of the respective files, simply tick the check box as shown below and click on Save.

IMPORTANT: The output in the pop-up window is very browser dependent and the quality of the results may vary. We recommend using the print preview option in individual browsers to see the final result.

At the time of writing the following conditions applied:

Firefox 11.0 produced a fairly faithful representation of the intended output but was slow.
Internet Explorer 9 produced a reasonable representation of the intended output, was faster than Firefox, but did not always respect page breaks.
Chrome 18 produced a terrible graphical look as it does not print background images in pages but was otherwise fast and true to page formatting.

Web Application Rule Set Patterns

The Composable Agentic Platform inline filter and built in proxy offers the ability to wrap and control the behaviour of a Web application. Usually this is done in conjunction with the Receive Web Application data input adaptor.

Rules in this environment can see all of the post data from the Web browser. By using rules such as the HTTP Request Tracker, HTTP Header Reader and other various rules for interacting with the application’s session object, it is possible to do a lot of additional pre-checking of the request being forwarded to the application before it is even processed.

Rules such as the HTTP Request Saver and HTTP Request Restorer also allow for requests to the backend server being temporarily placed on hold, pending actions by the X Agent.

When writing rules, it is a good idea to keep in mind that you may wish to test your rules without the application server being present. To do this, it is usually a good idea to create a Load rule first, which reaches out to all of the elements of the Web application, the request, dynamic databases, and so on to collect all of the data required for a decision before embarking on making that same decision. By doing so, you can insert a Test Data Creation rule at the point where all of the data is ready, thus allowing you to properly test the business functions of your rule set separately on a test server with all of the relevant data being available.

If you need to look at the output from a given request, insert an HTTP Server Execute rule. This will forward the current request to the server for processing and bring back the resulting response in a single variable. You can then scan that variable for information (for example, look for a balance value or a name) using the Scanner rule. This rule allows you to define the text surrounding information that you are interested in, and then extract that information into a variable.

Similarly, you can manipulate the response before forwarding it to the Web browser. As the response is simply stored in a variable, you can use the Replacer rule to modify text within given tags or specific locations inside the response page.

Once you have finished working with the output from a request, it is imperative that you forward it to the Web browser using the HTTP Response rule.

Finally, there may be situations where you wish to simply append additional data to the end of a Web page before it is transmitted to the user. Typically, this would be in the form of JavaScript appended to the end of the page. The best way to do this is to store the data in a data file, upload it to the server, read it into a variable using the File Reader rule, and append it to the server response using the HTTP Response Addition rule.

This section showcases a number of common rule set patterns used when working with Web applications.

Starting out (passive listening)

One of the first problems always facing you when you start working with a new application is the ability to understand which data flows where under each circumstance. The following rule set pattern, which includes comments, provides a good starting point:

First, the HTTP Request Tracker rule takes care of getting the browser information and adding tracking cookies. Second, a fast lookup to the MaxMind geo location database (which requires a subscription) identifies the origin of the request. Third, the request is written to the console so that you can monitor it in real time. Finally, the data is written to the test data queue so that you can download it for analysis.

This rule is best deployed to a test system during the initial rule writing phase to better understand what variables and pages are available. It is prudent to deploy it initially during a live install to retrieve a large portion of live test data and play it through the desired rule set on the Composable Agentic Platform test server.

Filtering out static content

A common issue in dealing with application servers that not only serve dynamic content, but also static data (in the form of images, style sheets, fixed HTML), is to filter this content before it even hits the core rule set. This is best done in the Load rule using a Name Splitter and Switch rule as shown:

The name splitter conveniently extracts the extension of the object being requested using the following properties:

The Switch rule operates on the EXT variable. By adding new chain points for each type of static content they are eliminated from reaching the rule set.

Timing a form

It is often a good idea to know the time a user has spent on a form. This is the foundation for filtering and/or slowing down “screen scrapers” or “data extraction” bots. This is an example of what a browser timing rule set looks like:

The basic concept is to first check if a session is present. If not, this rule set does not proceed. In some sites, this may be overly simplistic and may require modification, but for most sites it will be valid.

The rule set then goes through a series of checks. It reads the last time any request was made to the application, timestamps the current request and stores it. If it is possible to measure a time delay (via the previous timestamp), the method is a POST and the delay (in this case) is less than a second, an attack is assumed since no human can complete a form in less than a second.

A more sophisticated version of this rule set would include a CSV lookup to a list of known forms and the estimated time required to complete them. Based on that, a very effective defense against scripted readers can be mounted.

For reference, the rule set properties are listed below:

Collating data over multiple pages

In many instances, it is preferable to collate data for decisions over the course of multiple pages. The best way to do this is to use the HTTP Session Writer rule. The rules allow you to specify a list of variables and a list of corresponding key names so that they can be stored in the application server’s session.

The application server’s session provides a convenient place to store data that should only live for the time of the user’s online experience. As the application itself also has access to the session and can set its own keys, it is a good idea to choose key names that are unlikely to conflict with the application. For example, do not use keys such as “user” or “balance”. Instead, use “tomorrow_user” or “tomorrow_balance” (or some other unique prefix).

When the time comes to obtain all of the data in a single request, use the HTTP Session Reader rule. Specify all of the keys names you wish to read and the corresponding variables to restore them to, and you will have all of the available data required.

Serving up a page not known to the application

At times you may wish to serve up a Web page that is not known to the application. Examples of this include a two-factor request page, a challenge page or an information page for a rejected request. The easiest way to do this is to use a content rule set in the configuration, which will handle the delivery for you.

An alternative to using a content rule set is to create a template HTML document, upload it as a data file and deploy it to the target server.

Once it is deployed, it can be read using the File Reader rule. Next, have dynamic contents inserted using the Tag Replacer rule or the String Replacer rule. Finally, the HTTP Response rule can be used to serve the page back to the user. The following pattern shows this in action:

Creating links to pages and content now known by the application

There will be many times where you may wish to create a specific link to a page that does not form part of the application. You will only need to do this for application servers that do not allow you to control content via the content delivery rules. If you are in that unfortunate situation you can use the following approach:

You will need to “piggy-back” onto an existing page using URL parameters.

For example, the main page of an application could be “main.jsp”. However, by appending URL parameters to a link (for example as http://myapplication/main.jsp?ShowGif=penguin), you can use the following rules pattern to detect not just images and display them on request, but also additional pages that you may need to link to.

This pattern effectively sits ahead of the normal rule set for that page and allows you to serve up anything you need. The Switch rule makes it easy to handle multiple different files.

The properties for the first rule are:

The Switch then operates on the ShowGif variable. The File Reader then reads the correct file and sends it back to the user.

Adapting templates to style sheets

Once your application starts to deliver custom content, you would generally want it to “look and feel” the same way as the application it becomes a part of. The best way to manage this is to use style sheets. Many applications already have style sheets in place, and provided your new page is served up within a frame, it will automatically be applied to the new page.

However, if your page must stand alone or if it contains specific structures that are not covered by the standard style sheet, you may need to add style sheet tags to the template or an import reference to the applications style sheet. The HTML syntax for this is as follows:

<link rel="stylesheet" href="media/style.css" type="text/css" />

The name (href) of the style sheet will depend on the actual application, and some applications have more than one style sheet. The best way to find the ones that apply is to view the source of one of the pages within the application.

Forwarding the request to the server

Sometimes you may wish to allow the server to execute the request from the user so that you can look at the response it provides. The HTTP Server Execute rule provides the means to do that, as shown in the following example:

Note that this example also inserts extra data into the response. The next section covers this in more detail.

Manipulating the response to the user

Once the response data from a HTTP Server Execute rule has been obtained, you may wish to alter it before it is forwarded to the user. Examples of this include removing high-risk features if the user is coming into the application from an anonymous proxy or a country known for high levels of risk or add a picture (such as for advertising).

Composable Agentic Platform includes a number of string manipulation rules to make this task easy. The following shows the properties for the example mentioned above. It alters the response by adding an image to the page:

You can’t see it in the above example, but the full replacement text property is:

"<img src="main.jsp?ShowGif=penguin" />"

This maps back to the much earlier example of creating links to pages or content now already known to the application. The above will cause the browser to make a second request on the application server for the page URL shown. You can intercept the request and use it to return the image to the browser.

In this particular case, the image is inserted into the HTML in a spot that looks like this:

<ul>
  Your current account balance is $123.0
</ul>
ß image will be inserted here
<table width="100%"></table>

Taking charge of the application flow

When you detect a condition that requires action or further user input (such as a two-factor input request), your best option is to redirect to a “piggy-back” page as described above. You can do this by using an HTTP Redirect rule.

Please note that this cannot be done after an HTTP Server Execute rule. This is because the server has detected content already being written to the response and will no longer allow redirection.

The workaround for this is to send a response to the browser that causes it to redirect instead. This can be done with the HTTP Response rule, sending back a line of text as follows:

"
<script>
  window.location = "main.jsp?LoadPage=twofactor.html";
</script>
"

This pattern once again illustrates the “piggy-back” pattern in action.

Using Flight Recorders for basic web stats

Flight recorders can be used for more than logging of critical events for forensics. The following is an example of a flight recorder used simply to record web stats for a specific page:

The properties for the Flight Recorder Trigger are as follows:

The rule will trigger a single record into a flight recorder, giving you information about the user, browser, country or access, referrer and any other fields that you may wish to store. This can later be used to graph access to your application and give you valuable development and marketing feedback. Please see the “Working with Flight Recorders” section later in this book for more details.