libcudf
24.04.00
|
Input arguments to the read_json
interface.
More...
#include <json.hpp>
Public Member Functions | |
json_reader_options ()=default | |
Default constructor. More... | |
source_info const & | get_source () const |
Returns source info. More... | |
std::variant< std::vector< data_type >, std::map< std::string, data_type >, std::map< std::string, schema_element > > const & | get_dtypes () const |
Returns data types of the columns. More... | |
compression_type | get_compression () const |
Returns compression format of the source. More... | |
size_t | get_byte_range_offset () const |
Returns number of bytes to skip from source start. More... | |
size_t | get_byte_range_size () const |
Returns number of bytes to read. More... | |
size_t | get_byte_range_size_with_padding () const |
Returns number of bytes to read with padding. More... | |
size_t | get_byte_range_padding () const |
Returns number of bytes to pad when reading. More... | |
bool | is_enabled_lines () const |
Whether to read the file as a json object per line. More... | |
bool | is_enabled_mixed_types_as_string () const |
Whether to parse mixed types as a string column. More... | |
bool | is_enabled_dayfirst () const |
Whether to parse dates as DD/MM versus MM/DD. More... | |
bool | is_enabled_legacy () const |
Whether the legacy reader should be used. More... | |
bool | is_enabled_keep_quotes () const |
Whether the reader should keep quotes of string values. More... | |
bool | is_enabled_normalize_single_quotes () const |
Whether the reader should normalize single quotes around strings. More... | |
bool | is_enabled_normalize_whitespace () const |
Whether the reader should normalize unquoted whitespace characters. More... | |
json_recovery_mode_t | recovery_mode () const |
Queries the JSON reader's behavior on invalid JSON lines. More... | |
void | set_dtypes (std::vector< data_type > types) |
Set data types for columns to be read. More... | |
void | set_dtypes (std::map< std::string, data_type > types) |
Set data types for columns to be read. More... | |
void | set_dtypes (std::map< std::string, schema_element > types) |
Set data types for a potentially nested column hierarchy. More... | |
void | set_compression (compression_type comp_type) |
Set the compression type. More... | |
void | set_byte_range_offset (size_type offset) |
Set number of bytes to skip from source start. More... | |
void | set_byte_range_size (size_type size) |
Set number of bytes to read. More... | |
void | enable_lines (bool val) |
Set whether to read the file as a json object per line. More... | |
void | enable_mixed_types_as_string (bool val) |
Set whether to parse mixed types as a string column. Also enables forcing to read a struct as string column using schema. More... | |
void | enable_dayfirst (bool val) |
Set whether to parse dates as DD/MM versus MM/DD. More... | |
void | enable_legacy (bool val) |
Set whether to use the legacy reader. More... | |
void | enable_keep_quotes (bool val) |
Set whether the reader should keep quotes of string values. More... | |
void | enable_normalize_single_quotes (bool val) |
Set whether the reader should enable normalization of single quotes around strings. More... | |
void | enable_normalize_whitespace (bool val) |
Set whether the reader should enable normalization of unquoted whitespace. More... | |
void | set_recovery_mode (json_recovery_mode_t val) |
Specifies the JSON reader's behavior on invalid JSON lines. More... | |
Static Public Member Functions | |
static json_reader_options_builder | builder (source_info src) |
create json_reader_options_builder which will build json_reader_options. More... | |
Input arguments to the read_json
interface.
Available parameters are closely patterned after PANDAS' read_json
API. Not all parameters are supported. If the matching PANDAS' parameter has a default value of None
, then a default value of -1
or 0
may be used as the equivalent.
Parameters in PANDAS that are unavailable or in cudf:
Name | Description |
---|---|
orient | currently fixed-format |
typ | data is always returned as a cudf::table |
convert_axes | use column functions for axes operations instead |
convert_dates | dates are detected automatically |
keep_default_dates | dates are detected automatically |
numpy | data is always returned as a cudf::table |
precise_float | there is only one converter |
date_unit | only millisecond units are supported |
encoding | only ASCII-encoded data is supported |
chunksize | use byte_range_xxx for chunking instead |
Definition at line 88 of file io/json.hpp.
|
default |
Default constructor.
This has been added since Cython requires a default constructor to create objects on stack.
|
static |
create json_reader_options_builder which will build json_reader_options.
src | source information used to read json file |
|
inline |
Set whether to parse dates as DD/MM versus MM/DD.
val | Boolean value to enable/disable day first parsing format |
Definition at line 347 of file io/json.hpp.
|
inline |
Set whether the reader should keep quotes of string values.
val | Boolean value to indicate whether the reader should keep quotes of string values |
Definition at line 362 of file io/json.hpp.
|
inline |
Set whether to use the legacy reader.
val | Boolean value to enable/disable the legacy reader |
Definition at line 354 of file io/json.hpp.
|
inline |
Set whether to read the file as a json object per line.
val | Boolean value to enable/disable the option to read each line as a json object |
Definition at line 332 of file io/json.hpp.
|
inline |
Set whether to parse mixed types as a string column. Also enables forcing to read a struct as string column using schema.
val | Boolean value to enable/disable parsing mixed types as a string column |
Definition at line 340 of file io/json.hpp.
|
inline |
Set whether the reader should enable normalization of single quotes around strings.
val | Boolean value to indicate whether the reader should normalize single quotes around strings |
Definition at line 370 of file io/json.hpp.
|
inline |
Set whether the reader should enable normalization of unquoted whitespace.
val | Boolean value to indicate whether the reader should normalize unquoted whitespace characters i.e. tabs and spaces |
Definition at line 378 of file io/json.hpp.
|
inline |
Returns number of bytes to skip from source start.
Definition at line 184 of file io/json.hpp.
|
inline |
Returns number of bytes to pad when reading.
Definition at line 212 of file io/json.hpp.
|
inline |
Returns number of bytes to read.
Definition at line 191 of file io/json.hpp.
|
inline |
Returns number of bytes to read with padding.
Definition at line 198 of file io/json.hpp.
|
inline |
Returns compression format of the source.
Definition at line 177 of file io/json.hpp.
|
inline |
Returns data types of the columns.
Definition at line 167 of file io/json.hpp.
|
inline |
|
inline |
Whether to parse dates as DD/MM versus MM/DD.
Definition at line 248 of file io/json.hpp.
|
inline |
Whether the reader should keep quotes of string values.
Definition at line 262 of file io/json.hpp.
|
inline |
Whether the legacy reader should be used.
Definition at line 255 of file io/json.hpp.
|
inline |
Whether to read the file as a json object per line.
true
if reading the file as a json object per line Definition at line 234 of file io/json.hpp.
|
inline |
Whether to parse mixed types as a string column.
true
if mixed types are parsed as a string column Definition at line 241 of file io/json.hpp.
|
inline |
Whether the reader should normalize single quotes around strings.
Definition at line 269 of file io/json.hpp.
|
inline |
Whether the reader should normalize unquoted whitespace characters.
Definition at line 276 of file io/json.hpp.
|
inline |
Queries the JSON reader's behavior on invalid JSON lines.
Definition at line 283 of file io/json.hpp.
|
inline |
Set number of bytes to skip from source start.
offset | Number of bytes of offset |
Definition at line 318 of file io/json.hpp.
|
inline |
Set number of bytes to read.
size | Number of bytes to read |
Definition at line 325 of file io/json.hpp.
|
inline |
Set the compression type.
comp_type | The compression type used |
Definition at line 311 of file io/json.hpp.
|
inline |
Set data types for columns to be read.
types | Vector dtypes in string format |
Definition at line 297 of file io/json.hpp.
|
inline |
Set data types for a potentially nested column hierarchy.
types | Map of column names to schema_element to support arbitrary nesting of data types |
Definition at line 304 of file io/json.hpp.
|
inline |
Set data types for columns to be read.
types | Vector of dtypes |
Definition at line 290 of file io/json.hpp.
|
inline |
Specifies the JSON reader's behavior on invalid JSON lines.
val | An enum value to indicate the JSON reader's behavior on invalid JSON lines. |
Definition at line 385 of file io/json.hpp.