File Information

Descriptions for fields and options in the File Information section of the Advanced Uploader tab.

The tables in this topic explain the fields and options available in the File Information section of the Advanced Uploader tab.

Table 1. File Information fields and options
Field Description
File Type The format of data in the file (table).
Fixed
Fields (columns) in the file have a fixed length. For example, in each record (row) of the table, the first column is always 10 characters, the second is three characters, and the third is 20 characters.
Delimited
Fields (columns) in the file may be different lengths; a character is used to separate each field in the record (row) of the table. One of the most common file formats is CSV (comma-separated values). For example, each field in a CSV delimited file is separated by a comma ( , ) character.
Ignore Nulls Determines how 1010data treats a null character in an upload file.
Yes
1010data ignores the null character and continues to upload the data included after the null character. An N/A is inserted in place of the null.
No
Unless masked, 1010data will interpret a null character as the end of the file. Any data in the file after the null character will not be included in the upload.
Record Delimiter The delimiter used to indicate the end of a each record (row) in the file (table). A delimiter is a sequence of one or more characters used to specify the boundary between separate, independent regions in plain text files. In general, the two most commonly used record delimiters in 1010data are CRLF and LF.
CRLF
The Carriage Return Line Feed (CRLF) ('\r\n', 0x0D 0x0A) record delimiter is used in Microsoft Windows, DOS (MS-DOS, PC DOS, etc.), DEC TOPS-10, RT-11, CP/M, MP/M, Atari TOS, OS/2, Symbian OS, Palm OS, Amstrad CPC, and most other early non-Unix and non-IBM OSes. Select this option for PC files.
NL
The New Line (NL) (0x15) record delimiter is used in EBCDIC systems—mainly IBM mainframe systems, including z/OS (OS/390) and i5/OS (OS/400).
LF
The Line Feed (LF) ('\n', 0x0A) record delimiter is used in Multics, Unix and Unix-like systems (Linux, OS X, FreeBSD, AIX, Xenix, etc.), BeOS, Amiga, RISC OS, and others. Select this option for Mac OS files.
CRNL
The Carriage Return New Line (CRNL) (0x0D 0x15) record delimiter is used for files delimited by the EBCDIC carriage return character followed by the EBCDIC new line character. The CRNL record delimiter is rarely used.
None
The last column of each row does not have a delimiter. This option is used for fixed-width files.
Field Delimiter The character used to separate each field (column) in the record (row) of the file (table). This field displays when Delimited is selected from the Type drop-down list.
Comma
A comma ( , ) is used to separate the columns in each row of the table.
Tab
A tab stop is used to separate the columns in each row of the table.
Pipe
A pipe ( | ) is used to separate the columns in each row of the table.
Other
A character other than a comma, tab, or pipe is used to separate the columns in each row of the table.
When Other is selected, 1010data displays the Other field next to the Field Delimiter drop-down list. You must enter the character that is used to separate the columns in each row of the table.
Note: Only ASCII special characters may be used as a field delimiter.
Other Delimiter The character other than a comma, tab, or pipe that is used to separate each field (column) in the record (row) of the file (table).

This field only displays if Other Delimiter is selected in the Field Delimiter drop-down list.

# of Records to Skip The number of records (rows) to skip when uploading the file (table).

For example, the first row of a table usually contains names and not data. To exclude the first row of the table in the upload, enter 1. When the file is uploaded, the Insights Platform will ignore the first row in the table and start uploading the data beginning with row 2.

Load The total number of records (rows) in the file (table) to upload.

For example, to include only the first 100 rows of the table in the upload, enter 100. 1010data will upload the first 100 rows in the table and then ignore any rows that follow.

Advanced Options

The advanced options provide additional settings for input file handling and data structuring in 1010data.

To view these additional fields, click Advanced Options.

Table 2. Advanced Options fields
Field Description
Masking character The character used to encapsulate fields that may contain the delimiting character. By default, 1010data uses the quotation mark ( " ) character.

For example, a field of text data may also contain a comma—a common field delimiter. To prevent 1010data from splitting the data field at the comma, that field needs to be surrounded by the indicated masking character.

Note: If this field is left blank, 1010data uses the quotation mark ( " ) for the masking character. To set the masking character to none, enter \0.
Masking Width The maximum number of characters within a masked field that 1010data will consider when identifying field delimiter characters to ignore. The default maximum masking width is 1000 characters.

For example, if a masked field is 2000 characters in length, but the maximum masking width is set at 1000, only the field delimiter characters that occur within the first 1000 characters of the field are ignored. Any field delimiter characters after the first 1000 characters will be treated as a standard field delimiter.

Rows Per Seg The maximum number of records (rows) in the file (table) to include in a single segment. The default is 5 million rows and, in general, should not be adjusted.

However, if the uploaded table will be used for aggressive expanding purposes, the maximum number of rows in each segment may need to be reduced. Contact 1010data Support to discuss your needs and we will help you determine the appropriate setting.