Reference data items (dictionaries, synonym catalogs and string patterns) are defined in the configuration file in the element <reference-data-catalog>. Below some examples:
Dictionaries are stored within the <dictionaries> element within the reference data section. Three types of dictionaries can be added.
Datastore dictionaries
<reference-data-catalog>
<dictionaries>
...
<datastore-dictionary name="Lastnames" description="My datastore based dictionary">
<datastore-name>orderdb</datastore-name>
<column-path>EMPLOYEES.LASTNAME</column-path>
</datastore-dictionary>
...
</dictionaries>
</reference-data-catalog>
Text file dictionaries
<reference-data-catalog>
<dictionaries>
...
<text-file-dictionary name="Firstnames" description="My file based dictionary">
<filename>/path/to/first.txt</filename>
<encoding>UTF-8</encoding>
</text-file-dictionary>
...
</dictionaries>
</reference-data-catalog>
Value list dictionaries
<reference-data-catalog>
<dictionaries>
...
<value-list-dictionary name="Greetings" description="My simple value list">
<value>hello</value>
<value>hi</value>
<value>greetings</value>
<value>godday</value>
</value-list-dictionary>
...
</dictionaries>
</reference-data-catalog>
Synonym catalogs are stored within the <synonym-catalogs> element within the reference data section. Two types of dictionaries can be added.
Text file synonym catalogs
<reference-data-catalog>
<synonym-catalogs>
...
<text-file-synonym-catalog name="textfile_syn" description="My text file synonyms">
<filename>/path/to/synonyms.txt</filename>
<encoding>UTF-8</encoding>
<case-sensitive>false</case-sensitive>
</text-file-synonym-catalog>
...
</synonym-catalogs>
</reference-data-catalog>
Datastore synonym catalogs
<reference-data-catalog>
<synonym-catalogs>
...
<datastore-synonym-catalog name="datastore_syn" description="My datastore synonyms">
<datastore-name>orderdb</datastore-name>
<master-term-column-path>CUSTOMERS.CUSTOMERNAME</master-term-column-path>
<synonym-column-path>CUSTOMERS.CUSTOMERNUMBER</synonym-column-path>
<synonym-column-path>CUSTOMERS.PHONE</synonym-column-path>
</datastore-synonym-catalog>
...
</synonym-catalogs>
</reference-data-catalog>
Dictionaries are stored within the <string-patterns> element within the reference data section. Two types of string patterns can be added.
Regular expression (regex) string patterns
<reference-data-catalog>
<string-patterns>
...
<regex-pattern name="regex danish email" description="Danish email addresses">
<expression>[a-z]+@[a-z]+\.dk</expression>
<match-entire-string>true</match-entire-string>
</regex-pattern>
...
</string-patterns>
</reference-data-catalog>
Simple string patterns
<reference-data-catalog>
<string-patterns>
...
<simple-pattern name="simple email" description="Simple email pattern">
<expression>aaaa@aaaaa.aa</expression>
</simple-pattern>
...
</string-patterns>
</reference-data-catalog>