atoti.Table.load_csv()#
- Table.load_csv(path, /, *, columns={}, separator=',', encoding='utf-8', process_quotes=True, array_separator=None, date_patterns={}, client_side_encryption=None, **kwargs)#
Load a CSV into this scenario.
- Parameters:
The path to the CSV file to load.
.gz
,.tar.gz
and.zip
files containing compressed CSV(s) are also supported.The path can also be a glob pattern (e.g.
path/to/directory/**.*.csv
).columns (Mapping[str, str] | Sequence[str]) –
The collection used to name, rename, or filter the CSV file columns.
If an empty collection is passed, the CSV file must have a header. The CSV column names will be used as the
Table
column names.If a non empty
Mapping
withstr
keys is passed, the CSV file must have a header and the mapping keys must be column names of the CSV file. Columns of the CSV file absent from the mapping keys will not be loaded. The mapping values will be used as theTable
column names. The other parameters of this method accepting column names expect to be passed values of this mapping, not keys.If a non empty
Sequence
is passed, the CSV file must not have a header and the sequence must have as many elements as there are columns in the CSV file. The sequence elements will be used as theTable
column names.
separator (str | None) –
The character separating the values of each line.
If
None
, the separator will be inferred in a preliminary partial read.encoding (str) – The encoding to use to read the CSV.
process_quotes (bool | None) –
Whether double quotes should be processed to follow the official CSV specification:
True
:Each field may or may not be enclosed in double quotes (however some programs, such as Microsoft Excel, do not use double quotes at all). If fields are not enclosed with double quotes, then double quotes may not appear inside the fields.
A double quote appearing inside a field must be escaped by preceding it with another double quote.
Fields containing line breaks, double quotes, and commas should be enclosed in double-quotes.
False
: all double-quotes within a field will be treated as any regular character, following Excel’s behavior. In this mode, it is expected that fields are not enclosed in double quotes. It is also not possible to have a line break inside a field.None
: the behavior will be inferred in a preliminary partial read.
array_separator (str | None) –
The character separating array elements.
If not
None
, any field containing this separator will be parsed as an array.date_patterns (Mapping[str, str]) – A column name to date pattern mapping that can be used when the built-in date parsers fail to recognize the formatted dates in the passed files.
client_side_encryption (ClientSideEncryptionConfig | None) – The client side encryption configuration to use when loading data.
See also
read_csv()
for examples.