Oracle® Text Application Developer's Guide 10g Release 1 (10.1) Part Number B10729-01 |
|
|
View PDF |
This chapter describes Oracle Text administration.The following topics are covered:
While any user can create an Oracle Text index and issue a CONTAINS
query, Oracle Text provides the CTXSYS
user for administration and the CTXAPP
role for application developers.
The CTXSYS
user is created at install time. CTXSYS
can do the following:
View all indexes
Sync all indexes
Run ctxkbtc
, the knowledge base extension compiler
Query all system-defined views
Perform all the tasks of a user with the CTXAPP
role
Note: In previous releases of Oracle Text,CTXSYS had DBA privileges, and only CTXSYS could perform certain functions, such as modifying system-defined preferences or setting system parameters. See also Chapter 11, " Migrating Applications from Earlier Releases" for more on changes to CTXSYS . |
During a manual installation, after installation of the CTXSYS
schema is complete, you may want to run dr0lsys.sql
to lock and expire the CTXSYS
schema for security reasons. Alternatively, you can choose a good password for CTXSYS
when running dr0csys.sql
.
The CTXAPP
role is a system-defined role that enables users to do the following:
Create and delete Oracle Text preferences
Use the Oracle Text PL/SQL packages
Any user can create an Oracle Text index and issue a Text query. The CTXAPP
role enables users to create preferences and use the PL/SQL packages.
The system uses the standard SQL
model for granting roles to users. To grant a Text role to a user, use the GRANT
statement.
In addition, to allow application developers to call procedures in the Oracle Text PL/SQL packages, you must explicitly grant to each user EXECUTE
privileges for the Oracle Text package.
When there are inserts, updates, or deletes to documents in your base table, the DML queue stores the requests for documents waiting to be indexed. When you synchronize the index with CTX_DDL
.SYNC_INDEX
, requests are removed from this queue.
Pending DML requests can be queried with the CTX_PENDING
and CTX_USER_PENDING
views.
DML errors can be queried with the CTX_INDEX_ERRORS
or CTX_USER_INDEX_ERRORS
view.
Use the CTX_OUTPUT
PL/SQL package to log indexing and document service requests.
Use the CTX_REPORT
package to produce reports on indexes and queries. These reports can help you fine-tune or troubleshoot your applications.
The CTX_REPORT
package contains the following procedures:
These procedures create reports that describe an existing index or policy, including the settings of the index metadata, the indexing objects used, the settings of the attributes of the objects, and (for CTX_REPORT.DESCRIBE_INDEX
) index partition information, if any. These procedures are especially useful for diagnosing index-related problems.
This is sample output from DESCRIBE_INDEX
, run on a simple context index:
================================================================= INDEX DESCRIPTION================================================================= index name: "DR_TEST"."TDRBPRX0" index id: 1160 index type: context base table: "DR_TEST"."TDRBPR" primary key column: ID text column: TEXT2 text column type: VARCHAR2(80) language column: format column: charset column: ================================================================= INDEX OBJECTS ================================================================= datastore: DIRECT_DATASTORE filter: NULL_FILTER section group: NULL_SECTION_GROUP lexer: BASIC_LEXER wordlist: BASIC_WORDLIST stemmer: ENGLISH fuzzy_match: GENERIC stoplist: BASIC_STOPLIST stop_word: teststopword storage: BASIC_STORAGE r_table_clause: lob (data) store as (cache) i_index_clause: compress 2
CREATE_INDEX_SCRIPT
creates a SQL*Plus script that can create a duplicate of a given text index. Use this when you have an index but don't have the original script (if any) used to create that script and want to be able to re-create the index. For example, if you accidentally drop a script, CREATE_INDEX_SCRIPT
can re-create it; likewise, CREATE_INDEX_SCRIPT
can be useful if you have inherited indexes from another user but not the scripts that created them.
CREATE_POLICY_SCRIPT
does the same thing as CREATE_INDEX_SCRIPT
, except that it enables you to re-create a policy instead of an index.
This is sample output from CREATE_INDEX_SCRIPT
, run on a simple context index (not a complete listing):
begin ctx_ddl.create_preference('"TDRBPRX0_DST"','DIRECT_DATASTORE'); end; / ... / begin ctx_ddl.create_section_group('"TDRBPRX0_SGP"','NULL_SECTION_GROUP'); end; / ... begin ctx_ddl.create_preference('"TDRBPRX0_WDL"','BASIC_WORDLIST'); ctx_ddl.set_attribute('"TDRBPRX0_WDL"','STEMMER','ENGLISH'); ctx_ddl.set_attribute('"TDRBPRX0_WDL"','FUZZY_MATCH','GENERIC'); end; / begin ctx_ddl.create_stoplist('"TDRBPRX0_SPL"','BASIC_STOPLIST'); ctx_ddl.add_stopword('"TDRBPRX0_SPL"','teststopword'); end; / ... / begin ctx_output.start_log('TDRBPRX0_LOG'); end; / create index "DR_TEST"."TDRBPRX0" on "DR_TEST"."TDRBPR" ("TEXT2") indextype is ctxsys.context parameters(' datastore "TDRBPRX0_DST" filter "TDRBPRX0_FIL" section group "TDRBPRX0_SGP" lexer "TDRBPRX0_LEX" wordlist "TDRBPRX0_WDL" stoplist "TDRBPRX0_SPL" storage "TDRBPRX0_STO" ') /
This procedure creates a report showing the names of the internal index objects, along with their tablespaces, allocated sizes, and used sizes. It is useful for DBAs who may need to monitor the size of their indexes (for example, when disk space is at a premium).
Sample output from this procedure looks like this (partial listing):
================================================================= INDEX SIZE FOR DR_TEST.TDRBPRX10 ================================================================= TABLE: DR_TEST.DR$TDRBPRX10$I TABLESPACE NAME: DRSYS BLOCKS ALLOCATED: 4 BLOCKS USED: 1 BYTES ALLOCATED: 8,192 (8.00 KB) BYTES USED: 2,048 (2.00 KB) INDEX (LOB): DR_TEST.SYS_IL0000023161C00006$$ TABLE NAME: DR_TEST.DR$TDRBPRX10$I TABLESPACE NAME: DRSYS BLOCKS ALLOCATED: 5 BLOCKS USED: 2 BYTES ALLOCATED: 10,240 (10.00 KB) BYTES USED: 4,096 (4.00 KB) INDEX (NORMAL): DR_TEST.DR$TDRBPRX10$X TABLE NAME: DR_TEST.DR$TDRBPRX10$I TABLESPACE NAME: DRSYS BLOCKS ALLOCATED: 4 BLOCKS USED: 2 BYTES ALLOCATED: 8,192 (8.00 KB) BYTES USED: 4,096 (4.00 KB)
INDEX_STATS
produces a variety of calculated statistics about an index, such as how many documents are indexed, how many unique tokens the index contains, average size of its tokens, fragmentation information for the index, and so on. An example of a use of INDEX_STATS
might be in optimizing stoplists.
See the Oracle Text Reference for an example of the output of this procedure.
This procedure creates a report of logged queries, which you can use to perform simple analyses. With query analysis, you can find out:
which queries were made
which queries were successful
which queries were unsuccessful
how many times each query was made
You can combine these factors in various ways, such as determining the 50 most frequent unsuccessful queries made by your application.
See the Oracle Text Reference for an example of the output of this procedure.
TOKEN_INFO
is used mainly to diagnose query problems; for instance, to check that index data is not corrupted. As an example, you can use it to find out which documents are producing unexpected or bad tokens.
This is a lookup function, used mainly as input to other functions (CTX_DDL.OPTIMIZE_INDEX
, CTX_REPORT.TOKEN_INFO
, and so on).
You index documents and issue queries with standard SQL. No server is needed for performing batch DML. You can synchronize the CONTEXT
index with the CTX_DDL
.SYNC_INDEX
procedure.
See Also: For more information about indexing and index synchronization, see Chapter 3, " Indexing". |
The Oracle Text Manager is a Java application integrated with the Oracle Enterprise Manager.
The Text Manager enables administrators to create preferences, stoplists, sections, and indexes. This tool also enables administrators to perform DML.
See Also: for more information about the Oracle Text Manager, see the online help shipped with this tool. |