libcudf
23.12.00
|
Files | |
file | capitalize.hpp |
file | case.hpp |
std::unique_ptr<column> cudf::strings::capitalize | ( | strings_column_view const & | input, |
string_scalar const & | delimiters = string_scalar("", true, cudf::get_default_stream()) , |
||
rmm::cuda_stream_view | stream = cudf::get_default_stream() , |
||
rmm::mr::device_memory_resource * | mr = rmm::mr::get_current_device_resource() |
||
) |
Returns a column of capitalized strings.
If the delimiters
is an empty string, then only the first character of each row is capitalized. Otherwise, a non-delimiter character is capitalized after any delimiter character is found.
Any null string entries return corresponding null output column entries.
cudf::logic_error | if delimiter.is_valid() is false . |
input | String column |
delimiters | Characters for identifying words to capitalize |
stream | CUDA stream used for device memory operations and kernel launches |
mr | Device memory resource used to allocate the returned column's device memory |
std::unique_ptr<column> cudf::strings::is_title | ( | strings_column_view const & | input, |
rmm::cuda_stream_view | stream = cudf::get_default_stream() , |
||
rmm::mr::device_memory_resource * | mr = rmm::mr::get_current_device_resource() |
||
) |
Checks if the strings in the input column are title formatted.
The first character of each word should be upper-case while all other characters should be lower-case. A word is a sequence of upper-case and lower-case characters.
This function returns a column of booleans indicating true if the string in the input row is in title format and false if not.
Any null string entries result in corresponding null output column entries.
input | String column |
stream | CUDA stream used for device memory operations and kernel launches |
mr | Device memory resource used to allocate the returned column's device memory |
std::unique_ptr<column> cudf::strings::swapcase | ( | strings_column_view const & | strings, |
rmm::cuda_stream_view | stream = cudf::get_default_stream() , |
||
rmm::mr::device_memory_resource * | mr = rmm::mr::get_current_device_resource() |
||
) |
Returns a column of strings converting lower case characters to upper case and vice versa.
Only upper or lower case alphabetical characters are converted. All other characters are copied. Case conversion may result in strings that are longer or shorter than the original string in bytes.
Any null entries create null entries in the output column.
strings | Strings instance for this operation. |
stream | CUDA stream used for device memory operations and kernel launches |
mr | Device memory resource used to allocate the returned column's device memory. |
std::unique_ptr<column> cudf::strings::title | ( | strings_column_view const & | input, |
string_character_types | sequence_type = string_character_types::ALPHA , |
||
rmm::cuda_stream_view | stream = cudf::get_default_stream() , |
||
rmm::mr::device_memory_resource * | mr = rmm::mr::get_current_device_resource() |
||
) |
Modifies first character of each word to upper-case and lower-cases the rest.
A word here is a sequence of characters of sequence_type
delimited by any characters not part of the sequence_type
character set.
This function returns a column of strings where, for each string row in the input, the first character of each word is converted to upper-case, while all the remaining characters in a word are converted to lower-case.
Any null string entries return corresponding null output column entries.
input | String column |
sequence_type | The character type that is used when identifying words |
stream | CUDA stream used for device memory operations and kernel launches |
mr | Device memory resource used to allocate the returned column's device memory |
std::unique_ptr<column> cudf::strings::to_lower | ( | strings_column_view const & | strings, |
rmm::cuda_stream_view | stream = cudf::get_default_stream() , |
||
rmm::mr::device_memory_resource * | mr = rmm::mr::get_current_device_resource() |
||
) |
Converts a column of strings to lower case.
Only upper case alphabetical characters are converted. All other characters are copied. Case conversion may result in strings that are longer or shorter than the original string in bytes.
Any null entries create null entries in the output column.
strings | Strings instance for this operation. |
stream | CUDA stream used for device memory operations and kernel launches |
mr | Device memory resource used to allocate the returned column's device memory. |
std::unique_ptr<column> cudf::strings::to_upper | ( | strings_column_view const & | strings, |
rmm::cuda_stream_view | stream = cudf::get_default_stream() , |
||
rmm::mr::device_memory_resource * | mr = rmm::mr::get_current_device_resource() |
||
) |
Converts a column of strings to upper case.
Only lower case alphabetical characters are converted. All other characters are copied. Case conversion may result in strings that are longer or shorter than the original string in bytes.
Any null entries create null entries in the output column.
strings | Strings instance for this operation. |
stream | CUDA stream used for device memory operations and kernel launches |
mr | Device memory resource used to allocate the returned column's device memory. |