Use the Sampling Options tab in the Distributed Sampler to specify your sampling criteria.
To export your sampling configuration, you need to provide the following information:
To export your configuration, click Export. After you have successfully exported from the Distributed Sampler, the following files are created in the specified output directory:
Note: If your Knowledge Base and the data stores you want to sample are located on the same Windows machine that is running Data Express, you do not need to export configuration information, execute sampling from a script, or load results. Instead, you would bypass those steps and click the Start button in the Distributed Sampler to execute sampling.
To further control your sampling results, you can specify advanced sampling options, which are accessible from the Advanced button.
You can restrict which data elements are sampled by their data element types:
By default, all types are checked. Uncheck a type to remove it as a sampling candidate.
You can restrict which data elements are sampled by their data element sizes:
By default, all values are 0 (zero), which means to include all data elements regardless of the associated size values.
If a minimum value is set, the data element size to be included is the value greater than the specified MinValue. For example, if the Min length is changed to 1, data elements with a length of 2 or greater are included.
Likewise, if a maximum value is set, the data element size to be included is the value less than the specified MaxValue. For example, if the Max length is changed to 99, data elements with a length of 98 or less are included.
You can restrict the Min/Max calculation sampling for data elements:
The following options are available:
By default, the Out-of-range minimum value is 0 (zero), which means to include all data elements regardless of the actual value.
By default, the Out-of-range maximum value is 0 (zero), which means to include all data elements regardless of the actual value.
Known Restriction: The Max. and Min. Recalculation Additional Options currently affect the minimum and maximum data element values reported for standard and compressed sampling.
You can restrict which data elements are sampled based on data element class assignments. You can also reduce the number of data elements to sample from.
You can add a class to the table show on the left after selecting it in the Select a Class list. Only data elements that are assigned to a class specified in the Selected Class list will be sampled. This is especially helpful when you have several data stores containing data elements with multiple class assignments.
The Number of records is the total number of records to be sampled. The default value is 0, which means to include all records.
Use the Select distinct values check box to improve the sampling process for big tables. The SELECT DISTINCT command is used to provide the distinct values of a column. This option allows you to create a sampling result only for the distinct values of a column. When checked, already sampled values are discarded and the result will be set for unique ones.