The FilterCode..::..STAT_CHUNK type exposes the following members.
Fields
Name | Description | |
---|---|---|
attribute |
The property to be applied to the chunk. If a filter requires that the same text
have more than one property, it needs to emit the text once for each property
in separate chunks.
| |
breakType |
The type of break that separates the previous chunk from the current
chunk. Values are from the CHUNK_BREAKTYPE enumeration.
| |
cwcLenSource |
The length in characters of the source text from which the current
chunk was derived.
A zero value signifies character-by-character correspondence between
the source text and
the derived text. A nonzero value means that no such direct
correspondence exists
| |
cwcStartSource |
The offset from which the source text for a derived chunk starts in
the source chunk.
| |
flags |
Flags indicate whether this chunk contains a text-type or a
value-type property.
Flag values are taken from the CHUNKSTATE enumeration. If the CHUNK_TEXT flag is set,
IFilter::GetText should be used to retrieve the contents of the chunk
as a series of words.
If the CHUNK_VALUE flag is set, IFilter::GetValue should be used to retrieve
the value and treat it as a single property value. If the filter dictates that the same
content be treated as both text and as a value, the chunk should be emitted twice in two
different chunks, each with one flag set.
| |
idChunk |
The chunk identifier. Chunk identifiers must be unique for the
current instance of the IFilter interface.
Chunk identifiers must be in ascending order. The order in which
chunks are numbered should correspond to the order in which they appear
in the source document. Some search engines can take advantage of the
proximity of chunks of various properties. If so, the order in which
chunks with different properties are emitted will be important to the
search engine.
| |
idChunkSource |
The ID of the source of a chunk. The value of the idChunkSource member depends on the nature of the chunk:
If the chunk is a text-type property, the value of the idChunkSource member must be the same as the value of the idChunk member.
If the chunk is an public value-type property derived from textual content, the value of the idChunkSource member is the chunk ID for the
text-type chunk from which it is derived.
If the filter attributes specify to return only public value-type
properties, there is no content chunk from which to derive the current
public value-type property. In this case, the value of the
idChunkSource member must be set to zero, which is an invalid chunk.
| |
locale |
The language and sublanguage associated with a chunk of text. Chunk locale is used
by document indexers to perform proper word breaking of text. If the chunk is
neither text-type nor a value-type with data type VT_LPWSTR, VT_LPSTR or VT_BSTR,
this field is ignored.
|