tableau

package module
v0.9.1 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Feb 24, 2022 License: MIT Imports: 9 Imported by: 3

README

Tableau

Modern Configuration Converter

Release Status GitHub release (latest SemVer including pre-releases) go.dev GitHub

Tableau

A modern configuration converter based on Protobuf(proto3).

Features

  • Convert xlsx to JSON, JSON is the first-class citizen of exporting targets.
  • Use protobuf as the IDL(Interface Description Language) to define the structure of xlsx.
  • Use golang to develop the conversion engine.
  • Support multiple programming languages, thanks to protobuf.

Concepts

  • Importer: xlsx/xml importer.
  • IR: Intermediate Representation.
  • Filter: filter the IR.
  • Exporter: JSON(protojson), Text(prototext), Wire(protowire).
  • Protoconf: a configuration metadata format based on protobuf.

Workflow

xlsx -> Importer -> Protoconf -> Exporter -> JSON/Text/Wire

Types

  • Scalar
  • Message(struct)
  • List
  • Map(unordered)
  • Timestamp
  • Duration

TODO

protoc plugins
  • Golang
  • C++
  • C#/.NET
  • Python
  • Lua
  • Javascript/Typescript/Node
  • Java
Metadata
  • metatable: a message to describe the worksheet's metadata.
  • metafield: a message to describe the caption's metadata.
  • captrow: caption row, the exact row number of captions at worksheet. Newline in caption is allowed for more readability, and will be trimmed in conversion.
  • descrow: description row, the exact row number of descriptions at worksheet.
  • datarow: data row, the start row number of data.

Newline(line break) in major operating systems:

OS Abbreviation Escape sequence
Unix (linux, OS X) LF \n
Microsoft Windows CRLF \r\n
classic Mac OS/OS X CR \r

LF: Line Feed, CR: Carriage Return.

Mac OS X

Generator
  • generate protoconf by xlsx(header): xlsx -> protoconf
  • generate xlsx(header) by protoconf: protoconf -> xlsx
Conversion
  • xlsx -> JSON(default format and human readable)
  • xlsx -> Wire(small size)
  • xlsx -> Text(human debugging)
  • JSON -> xlsx
  • Wire -> xlsx
  • Text -> xlsx
Pretty Print
  • Multiline: every textual element on a new line
  • Indent: 4 space characters
  • JSON format support
  • Text format support
EmitUnpopulated
  • JSON: EmitUnpopulated specifies whether to emit unpopulated fields.
Scalar Types
  • interger: int32, uint32, int64 and uint64
  • float: float and double
  • bool
  • string
  • bytes
  • datetime, date, time, duration
Enumerations
  • enum: The Parser accepts three enum value forms:
    • enum value number
    • enum value name
    • enum value alias name (with EnumValueOptions specified)
  • enum: validate the enum value.
Composite Types
  • message: horizontal(row direction) layout, fields located in cells.
  • message: simple in-cell message, each field must be scalar type. It is a comma-separated list of fields. E.g.: 1,test,3.0. List's size need not to be equal to fields' size, as fields will be filled in order. Fields not configured will be filled default values due to its scalar type.
  • list: horizontal(row direction) layout, which is list's default layout, and each item can be message or scalar.
  • list: vertical(column direction) layout. and each item should be message.
  • list: simple in-cell list, element must be scalar type. It is a comma-separated list of elements. E.g.: 1,2,3.
  • list: keyed list, auto aggregate the struct with the same key field.
  • list: scalable or dynamic list size.
  • list: smart recognition of empty element at any position.
  • map: horizontal(row direction) layout.
  • map: vertical(column direction) layout, and is map's default layout.
  • map: unordered map or hash map.
  • map: ordered map.
  • map: simple in-cell map, both key and value must be scalar type. It is a comma-separated list of key:value pairs. E.g.: 1:10,2:20,3:30.
  • map: scalable or dynamic map size.
  • map: smart recognition of empty value at any position.
  • nesting: unlimited nesting of message, list, and map.
  • nesting: the composite type's first element can be composite type.
Default Values

Each scalar type's default value is same as protobuf.

  • interger: 0
  • float: 0.0
  • bool: false
  • string: ""
  • bytes: ""
  • in-cell message: each field's default value is same as protobuf
  • in-cell list: element's default value is same as protobuf
  • in-cell map: both key and value's default value are same as protobuf
  • message: all fields have default values
Empty
  • scalar: default value same as protobuf.
  • message: empty message will not be spawned if all fields are empty.
  • list: empty list will not be spawned if list's size is 0.
  • list: empty message will not be appended if list's element(message type) is empty.
  • map: empty map will not be spawned if map's size is 0.
  • map: empty message will not be inserted if map's value(message type) is empty.
  • nesting: recursively empty.
Merge
  • merge multiple workbooks
  • merge multiple worksheets
Workbook meta

workbook meta sheet @TABLEAU:

  • specify which sheets to be parsed
  • specify parser options for each sheet
Sheet Alias Nameline Typeline
Sheet1 ExchangeInfo 2 2
Datetime

Understanding about RFC 3339 for Datetime and Timezone Formatting in Software Engineering

# This is acceptable in ISO 8601 and RFC 3339 (with T)
2019-10-12T07:20:50.52Z
# This is only accepted in RFC 3339 (without T)
2019-10-12 07:20:50.52Z
  • "Z" stands for Zero timezone or Zulu timezone UTC+0, and equal to +00:00 in the RFC 3339.
  • RFC 3339 follows the ISO 8601 DateTime format. The only difference is RFC allows us to replace "T" with "space".

Use RFC 3339 , which is following ISO 8601.

  • Timestamp: based on google.protobuf.Timestamp, see JSON mapping
  • Timezone: see ParseInLocation
  • DST: Daylight Savings Time. There is no plan to handle this boring stuff.
  • Datetime: excel format: yyyy-MM-dd HH:mm:ss, e.g.: 2020-01-01 05:10:00
  • Date: excel format: yyyy-MM-dd or yyMMdd, e.g.: 2020-01-01 or 20200101
  • Time: excel format: HH:mm:ss or HHmmss, e.g.: 05:10:00 or 051000
  • Duration: based ongoogle.protobuf.Duration , see JSON mapping
  • Duration: excel format: form "72h3m0.5s", see golang duration string form
Transpose
  • Interchange the rows and columns of a worksheet.
Validation
  • Min
  • Max
  • Range
  • Options: e.g.: enum type
  • Foreign key
Error Message
  • Report clear and precise error messages when converter failed, please refer to the programming language compiler
  • Use golang template to define error message template
  • Multiple languages support, focused on English and Simplified Chinese
Performace
  • Stress test
  • Each goroutine process one worksheet
  • Mutiple process model

Contribution

Requirements
Protobuf

Goto Protocol Buffers v3.17.3, choose and download the correct platform of protoc, then install by README.

protoc-gen-go

Install: go install google.golang.org/protobuf/cmd/protoc-gen-go@v1.27.1

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

func Excel2Conf

func Excel2Conf(protoPackage, indir, outdir string, setters ...options.Option)

Excel2Conf converts excel files (with tableau header) to different formatted configuration files. Supported formats: JSON, Text, and Wire.

func Excel2Proto

func Excel2Proto(protoPackage, goPackage, indir, outdir string, setters ...options.Option)

Excel2Proto converts excel files (with tableau header) to protoconf files.

func ParseMeta

func ParseMeta(indir, relWorkbookPath string) importer.Importer

ParseMeta parses the @TABLEAU sheet in a workbook.

func Proto2Excel

func Proto2Excel(protoPackage, indir, outdir string)

Proto2Excel converts protoconf files to excel files (with tableau header).

func XML2Conf

func XML2Conf(protoPackage, indir, outdir string, setters ...options.Option)

XML2Conf converts xml files to different formatted configuration files. Supported formats: json, text, and wire.

func XML2Proto

func XML2Proto(protoPackage, goPackage, indir, outdir string, setters ...options.Option)

XML2Proto converts xml files to protoconf files.

Types

This section is empty.

Directories

Path Synopsis
cmd
internal
camelcase
Package camelcase is a micro package to split the words of a camelcase type string into a slice of words.
Package camelcase is a micro package to split the words of a camelcase type string into a slice of words.
confgen/mexporter
mexporter is the message exporter package, which can export one single message to different formts: JSON, Text, and Wire.
mexporter is the message exporter package, which can export one single message to different formts: JSON, Text, and Wire.
fs
proto

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL