Rev. 时间 作者
a63edce1c4f2 2022-10-24 18:34:15 Lorenzo Isella

I save the Spanish data also as a parquet file.

6380bf81ffc8 2022-10-24 18:33:16 Lorenzo Isella

I also save the data as a parquet file.

23c6993498b3 2022-10-24 18:32:12 Lorenzo Isella

I convert the year to double (easier for merging with other datasets).

bf3740694c5e 2022-10-24 02:42:30 Lorenzo Isella

I now have all the columns as in the old tam file.

5c3b73e73ddb 2022-10-23 17:01:34 Lorenzo Isella

I fixed a bug (repeated lines in covid) and I now save the output also as a parquet file.

c6eb618c6013 2022-10-23 16:46:19 Lorenzo Isella

I now open the tsv file without reading it. It is no longer loaded into memory.

1bb702817bd1 2022-10-23 16:39:02 Lorenzo Isella

A script showing how to convert a tsv to parquet using a schema and without loading the data into memory.

b37f14dee694 2022-10-21 20:17:15 Lorenzo Isella

I added a file which re-creates the tam data starting from a single extraction.

5b5e973bef79 2022-10-21 16:06:51 Lorenzo Isella

I file to work with parquet files without loading them in memory.

fa3f698240c9 2022-10-20 20:13:51 Lorenzo Isella

A script showing how to work with a large dataset which is converted from tsv to parquet. It is then loaded as an arrow table which means less memory consumption.

759e5fc884a6 2022-10-18 23:41:53 Lorenzo Isella

A code to analyse in detail the durations of the covid cases in the tracker.

aaf2c62db47d 2022-10-18 23:41:12 Lorenzo Isella

I save some other duration stats.

ba0b282dc9af 2022-10-18 22:30:17 Lorenzo Isella

I now also save some duration stats as an RDS file.

d8413ff11976 2022-10-17 15:58:59 Lorenzo Isella

I reinstalled bash_it from scratch and I changed the location of the folder with the scripts.

6905d385aa23 2022-10-13 18:09:52 Lorenzo Isella

I also save the output as an RDS file.

ff3870ba6d1d 2022-10-13 17:44:43 Lorenzo Isella

I cleaned a bit the code.

1abe54dd4fce 2022-10-13 01:49:30 Lorenzo Isella

I added also here some extra variables.

94d7e8de6ea6 2022-10-13 01:42:08 Lorenzo Isella

I added some extra variables.

9c8d36b9f284 2022-10-12 19:42:34 Lorenzo Isella

I now read two different input files.

17fa0be4a749 2022-10-12 19:40:47 Lorenzo Isella

Some modifications to produce a simpler output.

39cc02f88d4a 2022-10-11 22:48:58 Lorenzo Isella

I added and/or modified the scripts to deal with Polish, Spanish and Romanian transparency data.

cf3decb3c491 2022-10-05 16:20:37 Lorenzo Isella

A script to find info about aid to semiconductors and rare earth elements relying on a number of text tools.

39e6f9ac66ed 2022-10-04 22:56:16 Lorenzo Isella

I added a function to look for multiple keywords in a column.

fb00af179b09 2022-09-30 16:15:21 Lorenzo Isella

I added a parallel version of the function to look for a list of keywords.

569fd3539a77 2022-09-29 23:19:28 Lorenzo Isella

I added a script to look (in parallel) for multiple keywords and showing the progress of the search.

27289eb79302 2022-09-29 22:12:35 Lorenzo Isella

I added a new function to look for many keywords in one go.

f966b354558f 2022-09-29 18:12:16 Lorenzo Isella

I added a script which generates the correct zip files for the submissions to Eurostat.

7887a1c7b24f 2022-09-29 01:56:05 Lorenzo Isella

Now mutt also asks me about CC when composing an email.

0eeb2777a249 2022-09-28 16:00:28 Lorenzo Isella

A simple script to generate and save a Json file.

ea6948d69378 2022-09-28 15:56:57 Lorenzo Isella

A simple script to generate a JSON file.