Documentation ¶
Index ¶
Constants ¶
View Source
const String16RuneSize = 16
Variables ¶
This section is empty.
Functions ¶
func CSVtoNumpyMulti ¶
func CSVtoNumpyMulti(csvReader *csv.Reader, tbk io.TimeBucketKey, cvm *CSVMetadata, chunkSize int, isVariable bool, ) (npm *io.NumpyMultiDataset, endReached bool, err error)
Types ¶
type CSVConfig ¶
type CSVConfig struct { FirstRowHasColumnNames bool `yaml:"firstRowHasColumnNames"` TimeFormat string `yaml:"timeFormat"` Timezone string `yaml:"timeZone"` ColumnNameMap []string `yaml:"columnNameMap"` }
CSVConfig is constructed from the control file that specifies the formatting of the csv data.
type CSVMetadata ¶
type CSVMetadata struct { Config *CSVConfig // Configuration of the CSV file, including the names of the columns // DSV is data shapes inside this CSV file. The first 2 columns are "Epoch-date" and "Epoch-time". // If the schema of existent bucket is "Epoch,Ask,Bid", DSV is ["Epoch-date", "Epoch-time", "Epoch", "Ask", "Bid"]. DSV []io.DataShape // ColumnIndex maps the index of the columns in the CSV file to each time bucket in the DB. // ColumnIndex[i+2]=-1 when the column of DSV[i] doesn't exist in the provided CSV file. // e.g. when the bucket is "Epoch,Ask,Bid" and Column[3] = -1, it means the provided CSV doesn't have "Ask" column. ColumnIndex []int }
func ReadMetadata ¶
func ReadMetadata(dataFD, controlFD *os.File, dbDataShapes []io.DataShape) (csvReader *csv.Reader, cvm *CSVMetadata, err error)
ReadMetadata returns formatting info about the csv file containing the data to be loaded into the database.
Click to show internal directories.
Click to hide internal directories.