HiSoftware™
TagGen Server- Crawler Settings
The HiSoftware TagGen Server Administrator allows you to configure
crawler behavior for Individual Resources and Managed Groups. All
Inheritance Rules apply to the crawler.
The Crawler has three sets of server component information that
can be controlled.
- Metadata Rules
- Policy Rules
- HTML Keyword Generation Rules
| Function |
Description |
Benefit |
|
Metadata Rules |
| Full Replace
(Overwrite Existing Values) |
This instructs
the crawler to replace any metadata values that it may find
during a crawl. |
Allows
Enterprise-wide re-alignment to metadata policy without having
to revisit documents individually that may have bad metadata
values previously entered.
|
| Append (Add to
existing value if not the same) |
This instructs
the crawler to Append any metadata values that it is
instructed to add to existing values if it is not the same. |
This allows for
a planned co-existence of automated and manual systems that
share the same tags for update. |
| Ignore (Do
nothing if the Tag exists and has a value) |
This instructs
the crawler to do nothing if it finds the metadata name and it
has a value. |
This allows
metadata inheritance rules down to the individual document
level! |
| Use "Z" Tokens |
This instructs
the crawler to apply Z Tokens to the HTML Metadata that it
updates. |
Interoperability and GILS Compatibility for those sites that
require it. |
|
Policy Rules |
| Log all Failed
URLs/Files |
This instructs
the crawler to note/log every file that has failed the policy
check for administrative review. |
Provides
flexible method for managing policy. |
| Case Sensitive
Match |
This instructs
the crawler to validate the case of values checking. |
Allows for
Exact Match of values case when comparing policy versus actual
metadata. |
| Exact Match |
This instructs
the crawler to validate the case of values checking as well as
entire contents. |
Allows for
Exact Match of values case when comparing policy versus actual
metadata as well as entire contents. |
|
Keyword Generation |
| Automatically
Generate HTML Keywords |
This, if
selected, instructs the crawler to generate keywords. |
Allows for
Document Level Discovery personalization. |
| Find the Top #
Words |
Instructs the
Number of Words to find. |
Allows for
Optimal Flexibility to the administrator when defining the
exact number of keywords to return. |
| Edit Noise
Words |
Allows the
Administrator to define a list of words that the crawler
should ignore. |
Allows for
Filtering out of common words that have no meaning for
discovery. |
| # of Characters |
Instructs the
Crawler how many characters post the </HEAD> tag to go deep in
order to build its list of keywords. |
Provides the best and most respected method for returning
discovery level keywords. |
Printer Friendly Version...
|
|
TagGen Server
Information: FAQS,
Metadata,
Inheritance,
Policy,
Technical,
Content Classification,
5 Minutes to KM, Resources,
Crawler Settings,
Reporting
|
TagGen Office
|