|
Turi Create
4.0
|
Classes | |
| struct | turi::csv_line_tokenizer |
| class | turi::csv_writer |
| struct | turi::dataframe_t |
| class | turi::dataframe_row_iterator |
| struct | turi::csv_file_handling_options |
Functions | |
| void | turi::parallel_dataframe_iterate (const dataframe_t &df, std::function< void(dataframe_row_iterator &iter, size_t startrow, size_t endrow)> partialrowfn) |
| std::istream & | turi::eol_safe_getline (std::istream &is, std::string &t) |
| std::map< std::string, std::shared_ptr< sarray< flexible_type > > > | turi::parse_csvs_to_sframe (const std::string &url, csv_line_tokenizer &tokenizer, csv_file_handling_options options, sframe &frame, std::string frame_sidx_file="") |
| std::istream& turi::eol_safe_getline | ( | std::istream & | is, |
| std::string & | t | ||
| ) |
std::getline replacement that correctly handles all \r, \n and \r\n line break characters.
| void turi::parallel_dataframe_iterate | ( | const dataframe_t & | df, |
| std::function< void(dataframe_row_iterator &iter, size_t startrow, size_t endrow)> | partialrowfn | ||
| ) |
Cuts up the provided begin iterator to a dataframe into rows, calling the lambda with a new iterator and the range of rows it is meant to process.
| std::map<std::string, std::shared_ptr<sarray<flexible_type> > > turi::parse_csvs_to_sframe | ( | const std::string & | url, |
| csv_line_tokenizer & | tokenizer, | ||
| csv_file_handling_options | options, | ||
| sframe & | frame, | ||
| std::string | frame_sidx_file = "" |
||
| ) |
Parses a CSV file / glob of CSV files to an SFrame.
| url | Path or Glob to read files |
| tokenizer | CSV tokenization options |
| options | Other file handling options |
| frame | Returned sframe object. This should be an uninitialized sframe. |
| frame_sidx_file | Location to save the result. Optional. Defaults to cache. |