• R/O
  • SSH

提交

标签
No Tags

Frequently used words (click to add to your profile)

javac++androidlinuxc#windowsobjective-ccocoa誰得qtpythonphprubygameguibathyscaphec計画中(planning stage)翻訳omegatframeworktwitterdomtestvb.netdirectxゲームエンジンbtronarduinopreviewer

Commit MetaInfo

修订版5c3b73e73ddbbc798b6b3e0db3a0b9cc0a0514de (tree)
时间2022-10-23 17:01:34
作者Lorenzo Isella <lorenzo.isella@gmai...>
CommiterLorenzo Isella

Log Message

I fixed a bug (repeated lines in covid) and I now save the output also as a parquet file.

更改概述

差异

diff -r c6eb618c6013 -r 5c3b73e73ddb R-codes/create_tam_parquet.R
--- a/R-codes/create_tam_parquet.R Sun Oct 23 09:46:19 2022 +0200
+++ b/R-codes/create_tam_parquet.R Sun Oct 23 10:01:34 2022 +0200
@@ -61,7 +61,8 @@
6161 covid <- open_dataset("SA-Covid19.csv", format="csv") |>
6262 rename("case_reference"="Case Reference") |>
6363 select(case_reference) |>
64- mutate(is_covid_case="Yes")
64+ mutate(is_covid_case="Yes") |>
65+ distinct()
6566
6667
6768
@@ -141,4 +142,13 @@
141142 max_rows_per_file = 1e7
142143 )
143144
145+
146+write_dataset(
147+ df_new,
148+ format = "parquet",
149+ path = "./data_output/",
150+ max_rows_per_file = 1e7
151+)
152+
153+
144154 print("So far so good")