Capri Overview
| Capri Uniqueness
| Capri Key features
| Capri Userbase
| Capri System
Requirements
Capri Key Features
Capri has a number of unique
features that have helped it maintain its innovative edge
over other Sequence Detection Algorithms. These include,
but are not limited to: Highly
Scalable
Capri has been
successfully applied to gigabytes of data consisting of
millions of records.
Ability to incorporate
Domain Knowledge
The user can define a taxonomy
on each of the attributes describing the events from which
the sequence patterns are to be discovered. This allows
sequences to be discovered at varying levels of generalisation.
back
to top
Template description language
Sequence Discovery Algorithms
in keeping with the Apriori family of algorithms can discover
a large number of sequences. Capri provides an XML based
language to describe the patterns of interest in a specific
discovery run of Capri. Only sequences matching the templates
defined are discovered.
back
to top
Discrete and Continuous Events.
Sequence detection
algorithms generally deal with discrete events. Capri
has been extended to be usable will continuous valued
series and takes input data in the form:
| Date/Time |
Commodity
1 Price |
Commodity
2 Price |
| 08/09/2001
09:35 |
27.59209 |
25.61969 |
| 08/10/2001
09:35 |
28.11783 |
26.09171 |
| 08/13/2001
09:35 |
27.85907 |
26.09961 |
| 08/14/2001
09:35 |
27.94062 |
25.87783 |
| 08/15/2001
09:35 |
27.62899 |
26.04845 |
| 08/16/2001
09:35 |
27.37411 |
25.6124 |
| 08/17/2001
09:35 |
26.85992 |
25.29884 |
This makes Capri applicable
to the financial sector where data is generally a continuous
valued series.
Multi-Variate sequence detection
Capri can
discover sequence pattern in data where there is more
than one attribute describing the event and the goal
is to find sequence patterns relating to the occurrence
of particular attribute values within each event.
back
to top
PMML and XML
Capri provides
XML representations of the data input and results that
ease its integration with other software systems.
PMML is a standard developed by Corporate Intellect
in collaboration with Oracle, IBM, SPSS, SAS, Magnify
and other key data-mining vendors. The goal of PMML
is to represent knowledge discovered by data mining
algorithms in an open XML-based standard enabling interoperability
of models between data mining vendors, knowledge generation
providers and knowledge consumer software systems in
general.
back
to top
Complex Sequence Pattern
Discovery
Capri provides
the widest range of parameters to its users allowing
them to have the flexibility to discover more specific
sequence types based on their needs.
|