Oracle Data Mining Concepts 10g Release 1 (10.1) Part Number B10698-01 |
|
|
View PDF |
Some data-mining enabled applications have models that are developed on one system and then deployed to other (production) systems. Production applications often need only to apply models built elsewhere. The Oracle Data Mining Scoring Engine supports scoring data (applying models) using models created elsewhere.
The Scoring Engine allows customers to limit the Oracle Data Mining (ODM) functionality available within their scoring applications to ensure that compute-intensive operations such as model building are not performed on systems where Scoring Engine is installed.
The ODM Scoring Engine supports operations for preparing data as required from the build process, importing a model, and applying a model to data. All transformation functionality is included in the Scoring Engine. All functionality provided in the Scoring Engine behaves exactly as the full system.
You cannot build models using the Scoring Engine.
"Data Mining Scoring Engine" is a custom install option for Oracle Data Mining; select this option to install the ODM Scoring Engine.
A single model can be used to score large volumes of data, often in multiple geographically distributed application settings. Data analysis and model building might be performed by a small group of data mining experts using data from a centralized data warehouse. However, the model, can be used by a much larger number of applications working with data at geographically dispersed sites using local data. Local data may consist of millions of records representing customers; therefore, it can make sense to move the model to where the data is.
In real-time applications such as call centers, models are often built in one environment and used in another. There may be one machine dedicated to model building, using large volumes of data to produce models on a daily basis. Several other machines may be dedicated to real-time scoring, receiving new models to support, for example, the call center application. Call center representatives collect information from callers; the collected information is then used to obtain predictions or recommendations for that particular caller in real time. Scoring in real time often requires that the model is moved to where the data is.
Oracle Data Mining supports two ways to move models from one schema or debase instance to another:
You can transport Naive Bayes and Association models between ODM instances using the Export task followed by the Import to import. The PMML string produced by the Export task can be moved via file transport mechanisms for import at a target ODM instance or Scoring Engine instance. The ODM Export Task places the PMML string in a table in a cell of type CLOB, The Import Task reads the PMML string from a table cell of type CLOB. The sample program PMMLDemo.java
illustrates exporting and importing a Naive Bayes model.
In a narrow sense, native export and import implies moving data mining model out of and into Oracle databases using Oracle proprietary facilities.
Native export and import are currently supported for data mining models built using the ODM PL/SQL interface only, namely those models created using the DBMS_DATA_MINING package.
Naive model export and import is based on Oracle Data Pump technology.
Native export is supported at three different levels, as follows:
expdp)
, all data mining models in the database are exported.expdp)
, all data mining models in the schema are exported.DBMS_DATA_MINING.export_model procedure
.Native import is also supported in all scenarios. Using the dump file set produced by the Oracle Data Pump Export Utility (expdp)
, an ODM user can run the Oracle Data Pump Import Utility (impdp)
to import all data mining models contained in the dump file set.
ODM users can import a specific model from the dump file set using DBMS_DATA_MINING.import_model
procedure.
For more information about native export and import, see the Oracle 10g Data Mining Administrator's Guide and the DBMS_DATA_MINING chapter in the PL/SQL Packages and Types Reference.
Suppose that the application builds a model on one system named BLDSYS and applies the model on a different system named SCORESYS.
BLDYS must have ODM installed. ODM supports all data mining activities (building models, testing models, applying models, etc.). SCORESYS can have the ODM Scoring Engine installed. It will not be possible to build models on SCORESYS, but it will be possible to apply the model. Note that SCORESYS could have a full ODM product installation.
The following processing takes place: