Configure Parquet connectivity
To preprocess tables that use Parquet, Hunk uses a preprocessor called ParquetSplitGenerator
. To use ParquetSplitGenerator
to read your Parquet tables, update your [provider]
stanza to designate ParquetSplitGenerator
and specify the path in your [virtual index]
stanza:
[provider:your-provider] vix.splunk.search.splitter = ParquetSplitGenerator [your-vix] vix.input.1.path = /user/hive/warehouse/t1/...
For best results, also require a time stamp for the virtual index:
vix.input.1.required.fields = timestamp
For more information about general virtual index settings, see About virtual indexes.
Configure Hive connectivity | Configure search head clustering |
This documentation applies to the following versions of Hunk®(Legacy): 6.1, 6.1.1, 6.1.2, 6.1.3, 6.2, 6.2.1, 6.2.2, 6.2.3, 6.2.4, 6.2.5, 6.2.6, 6.2.7, 6.2.8, 6.2.9, 6.2.10, 6.2.11, 6.2.12, 6.2.13, 6.3.0, 6.3.1, 6.3.2, 6.3.3, 6.3.4, 6.3.5, 6.3.6, 6.3.7, 6.3.8, 6.3.9, 6.3.10, 6.3.11, 6.3.12, 6.3.13, 6.4.0, 6.4.1, 6.4.2, 6.4.3, 6.4.4, 6.4.5, 6.4.6, 6.4.7, 6.4.8, 6.4.9, 6.4.10, 6.4.11
Feedback submitted, thanks!