Turi Create  4.0
ML Data

Classes

class  turi::ml_data_internal::column_indexer
 
class  turi::ml_data_internal::column_statistics
 
class  turi::ml_metadata
 
class  turi::ml_data
 
struct  turi::ml_data_full_entry
 
class  turi::ml_data_iterator
 
class  turi::ml_data_row_reference
 

Variables

static constexpr int turi::ml_data_internal::_column_indexer_first_level_lookup_size_n_bits = 8
 
enum  turi::ml_missing_value_action : int
 
enum  turi::ml_column_mode : int
 
static GL_HOT_INLINE_FLATTEN bool turi::mode_is_categorical (ml_column_mode mode)
 
static GL_HOT_INLINE_FLATTEN bool turi::mode_has_fixed_size (ml_column_mode mode)
 
static GL_HOT_INLINE_FLATTEN bool turi::mode_is_indexed (ml_column_mode mode)
 
const char * turi::column_mode_enum_to_name (ml_column_mode mode)
 

Detailed Description

Data Normalization. See ml_data for details

Enumeration Type Documentation

◆ ml_column_mode

enum turi::ml_column_mode : int
strong

The main mode of each entry value; determines how it is stored and how it is translated and what functionality it works with.

Definition at line 32 of file ml_data_column_modes.hpp.

◆ ml_missing_value_action

enum turi::ml_missing_value_action : int
strong

The missing value action.

Definition at line 27 of file ml_data_column_modes.hpp.

Function Documentation

◆ column_mode_enum_to_name()

const char* turi::column_mode_enum_to_name ( ml_column_mode  mode)

For error reporting, returns a name of the mode based on the column mode value.

◆ mode_has_fixed_size()

static GL_HOT_INLINE_FLATTEN bool turi::mode_has_fixed_size ( ml_column_mode  mode)
inlinestatic

Returns true if the underlying type always results in constant size pattern, and false otherwise.

Definition at line 66 of file ml_data_column_modes.hpp.

◆ mode_is_categorical()

static GL_HOT_INLINE_FLATTEN bool turi::mode_is_categorical ( ml_column_mode  mode)
inlinestatic

Returns true if the underlying type is treated as a categorical variable, and false otherwise.

Definition at line 47 of file ml_data_column_modes.hpp.

◆ mode_is_indexed()

static GL_HOT_INLINE_FLATTEN bool turi::mode_is_indexed ( ml_column_mode  mode)
inlinestatic

Returns true if the underlying type is indexed, and false otherwise. This differs form the is_categorical in that dictionaries are not treated as pure categorical variables, as they have values associated with them, but they are indexed.

Definition at line 87 of file ml_data_column_modes.hpp.

Variable Documentation

◆ _column_indexer_first_level_lookup_size_n_bits

constexpr int turi::ml_data_internal::_column_indexer_first_level_lookup_size_n_bits = 8
static

Use a two-level hash table to store the index mappings. The first level is constant size and unlocked, determined by an n-bit hash. Each leaf in this one contains a hash table and lock. This significantly reduces lock contention. This _column_metadata_first_level_lookup_size_n_bits gives the number of bits used for this first lookup.

Definition at line 37 of file column_indexer.hpp.