Vienna.rb Talk Notes: Mining for Gold Using the World's #1 and Most Popular Data Format (w/ Ruby and CsvReader)

Gerald_Bauer1 · 19 October 2018 13:24

Hello,

the talk notes from yesterday's Vienna.rb meetup titled Mining for
Gold Using the World's #1 and Most Popular Data Format (w/ Ruby and
CsvReader) [1]. The contents reads:

- Q: The World's Most Popular Data Format?
  - Data Format Timeline / History - Past, Present, Future
  - Data Formats @ Statistics Austria
    - Triva Quiz: Who Invented the Space Character in Writing (and When)?
    - Triva Quiz: Who Invented the Space Character in CSV (and When)?
  - Data Formats @ Open Gov Data Austria
  - Data Formats @ Open Gov Data United States of America (U.S.A.)
  - Data Formats @ DataHub v1
    - Data Formats @ DataHub v2 - (Tabular) Data Packages
  - And the Winner is ... Lies, Damned Lies and Statistics
- What's Comma-Separated Values (CSV) - One Format? Many Formats?
   - CSV Basics - What about commas in values?
   - CSV Basics - What about quotes in quotes?
   - CSV Basics - Many Formats / Dialects / Variants?
   - CSV Basics - Edge Cases
   - CSV Basics - Type Inference and Data Converters
- CSV Formats / Variants
  - CSV "The Right Way"
  - CSV "Strict"
  - CSV <3 Numeric
  - CSV <3 JSON
  - CSV <3 YAML
  - Database Export
    - PostgreSQL CSV
    - PostgreSQL Text
    - MySQL
- CsvReader Library Usage
  - What about type inference and data converters?
  - What about Enumerable?
  - What about headers?
  - What about symbol keys for hashes?
  - What about (typed) structs?
  - What about tabular data packages with pre-defined types / schemas?
- Triva Quiz - Mining for Gold - What's the country with the biggest
gold mining / production per year today?

Happy data wrangling / mining with Ruby. Cheers. Prost.

PS: The talk slides from the 2nd talk titled Designing HexaPDF:
Iterarative Design, Orthogonality and Other Design Tools [2] by Thomas
Leitner

[1] https://github.com/geraldb/talks/blob/master/csv.md
[2] https://talks.gettalong.org/2018-10-viennarb

Topic		Replies	Views
csvreader v1.0 - read tabular comma-separated values (csv) the right way (incl. hash readers, converters, enumerables, dialects, and more) ruby-talk	0	352	9 October 2018
Csv ruby-talk	0	109	24 February 2014
Why the CSV standard library is broken, broken, broken (and how to fix it) ruby-talk	13	2243	24 August 2018
Re: Re: Mastering CSV in Ruby book(let) pdf download (37-pages, 185kb) is now free (Donation Optional). Happy Monday! -- Paweł Dąbrowski (Ruby is dead, long live Rails!) ruby-talk	1	248	24 August 2022
Re: Mastering CSV in Ruby book(let) pdf download (37-pages, 185kb) is now free (Donation Optional). Happy Monday! -- Paweł Dąbrowski (Ruby is dead, long live Rails!) ruby-talk	0	194	23 August 2022

Vienna.rb Talk Notes: Mining for Gold Using the World's #1 and Most Popular Data Format (w/ Ruby and CsvReader)

Related topics