@jamesog thanks for sharing this! I’m going to have to play with it. I have some complex #JSON cases that might really benefit. I also see promise in cases where today I might translate #XLSX to #CSV and then import into #SQLite. Why do all that if I can query the original directly? Awesome!
It's batshit that #Excel doesn't quote CSV exports by default (yes, I know the standard says "may" and not "must" and, yes, you can make custom export templates but what a pain in the ass).
(This is all you need to know about how my day is going)
But it's fine because I have an old #python scrip I can use to convert excel to proper #csv.
Just need to rewrite as python3, oh, and install pandas, and openpyxl and... hey a dependency issue but that's fine.
It's all fine.
Since #Mint is shutting down, it's important to know that Mint exports to CSV. The #FireflyIII Data Importer can handle #CSV files just fine. If some people could send me a few lines of Mint export data, I can make sure there's a tailor-made import configuration ready to go!
#CsvDiff has finally reached v0.1.0, it's first ever non-alpha/-beta release! 🎉
New features like getting at the headers from the diffresult have been needed for the following PR in qsv (which is in final review): https://github.com/jqnatividad/qsv/pull/1395
When merged, you'll be able to decide, whether the diffresult should output headers or not (see examples in the PR). :awesome:
Collections methods are added to the Reader class to improve DX as well as the ability to format the records prior to accessing them using the same technique already present on the Writer class.
#HowToThing#008 — CSV parsing & filtering into structured data via https://thi.ng/csv and creating a multi-plot data visualization via https://thi.ng/viz (along with a range of other helpful packages for various side aspects).
The attached visualization shows a lin-log plot of new COVID cases between March 2020 - Dec 2021:
Daily world total as line plot
UK (red) and USA (blue) cases as interleaved bar plots
"Crosswalker is a general purpose tool for joining columns of text data that don't perfectly match."
"The tool auto-ranks matches for each data row ...
The tool auto-matches values that are practically identical
The results are presented in an interactive spreadsheet from which you can manually continue matching
As you go, the columns are resorted to highlight the most probable remaining matches"