Reference data

Reference data items (dictionaries, synonym catalogs and string patterns) are defined in the configuration file in the element <reference-data-catalog>. Below some examples:

Dictionaries

Dictionaries are stored within the <dictionaries> element within the reference data section. Three types of dictionaries can be added.

Datastore dictionaries

				<reference-data-catalog>
				  <dictionaries>
				   ...
				    <datastore-dictionary name="Lastnames" description="My datastore based dictionary">
				      <datastore-name>orderdb</datastore-name>
				      <column-path>EMPLOYEES.LASTNAME</column-path>
				    </datastore-dictionary>
				   ...
				  </dictionaries>
				</reference-data-catalog>
			

Text file dictionaries

				<reference-data-catalog>
				  <dictionaries>
				   ...
				    <text-file-dictionary name="Firstnames" description="My file based dictionary">
				      <filename>/path/to/first.txt</filename>
				      <encoding>UTF-8</encoding>
				    </text-file-dictionary>
				   ...
				  </dictionaries>
				</reference-data-catalog>
			

Value list dictionaries

				<reference-data-catalog>
				  <dictionaries>
				   ...
				    <value-list-dictionary name="Greetings" description="My simple value list">
				      <value>hello</value>
				      <value>hi</value>	
				      <value>greetings</value>
				    <value>godday</value>
				    </value-list-dictionary>
				   ...
				  </dictionaries>
				</reference-data-catalog>
			

Synonym catalogs

Synonym catalogs are stored within the <synonym-catalogs> element within the reference data section. Two types of dictionaries can be added.

Text file synonym catalogs

				<reference-data-catalog>
				  <synonym-catalogs>
				   ...
				    <text-file-synonym-catalog name="textfile_syn" description="My text file synonyms">
				      <filename>/path/to/synonyms.txt</filename>
				      <encoding>UTF-8</encoding>
				      <case-sensitive>false</case-sensitive>
				    </text-file-synonym-catalog>
				   ...
				  </synonym-catalogs>
				</reference-data-catalog>
			

Datastore synonym catalogs

				<reference-data-catalog>
				  <synonym-catalogs>
				   ...
				    <datastore-synonym-catalog name="datastore_syn" description="My datastore synonyms">
				      <datastore-name>orderdb</datastore-name>
				      <master-term-column-path>CUSTOMERS.CUSTOMERNAME</master-term-column-path>
				      <synonym-column-path>CUSTOMERS.CUSTOMERNUMBER</synonym-column-path>
				      <synonym-column-path>CUSTOMERS.PHONE</synonym-column-path>
				    </datastore-synonym-catalog>
				   ...
				  </synonym-catalogs>
				</reference-data-catalog>
			

String patterns

Dictionaries are stored within the <string-patterns> element within the reference data section. Two types of string patterns can be added.

Regular expression (regex) string patterns

				<reference-data-catalog>
				  <string-patterns>
				   ...
				    <regex-pattern name="regex danish email" description="Danish email addresses">
				      <expression>[a-z]+@[a-z]+\.dk</expression>
				      <match-entire-string>true</match-entire-string>
				    </regex-pattern>
				   ...
				  </string-patterns>
				</reference-data-catalog>
			

Simple string patterns

				<reference-data-catalog>
				  <string-patterns>
				   ...
				    <simple-pattern name="simple email" description="Simple email pattern">
				      <expression>aaaa@aaaaa.aa</expression>
				    </simple-pattern>
				   ...
				  </string-patterns>
				</reference-data-catalog>