• R/O
  • SSH

提交

标签
No Tags

Frequently used words (click to add to your profile)

javac++androidlinuxc#windowsobjective-ccocoa誰得qtpythonphprubygameguibathyscaphec計画中(planning stage)翻訳omegatframeworktwitterdomtestvb.netdirectxゲームエンジンbtronarduinopreviewer

Commit MetaInfo

修订版fc61ccc61df4069e9df1def452aa45aef27b70ec (tree)
时间2024-07-19 22:58:04
作者Lorenzo Isella <lorenzo.isella@gmai...>
CommiterLorenzo Isella

Log Message

I fixed a bug (I used to save twice the jobs in English!).

更改概述

差异

diff -r 560c914baca7 -r fc61ccc61df4 R-codes/clean_text.R
--- a/R-codes/clean_text.R Wed Jul 17 22:25:00 2024 +0200
+++ b/R-codes/clean_text.R Fri Jul 19 15:58:04 2024 +0200
@@ -5,12 +5,12 @@
55 source("/home/lorenzo/myprojects-hg/R-codes/stat_lib.R")
66
77
8-from_scratch <- 0
8+from_scratch <- 1
99
10-df_jobs <- read_csv("wi_dataset.csv")
10+df_jobs <- read_csv("../input/wi_dataset.csv")
1111
1212
13-labels <- read_csv("wi_labels.csv")
13+labels <- read_csv("../input/wi_labels.csv")
1414
1515 jobs <- df_jobs |>
1616 pull(description)
@@ -37,23 +37,23 @@
3737 }
3838
3939
40-saveRDS(job_language, "job_language_list.RDS")
40+saveRDS(job_language, "../output/job_language_list.RDS")
4141 } else {
4242
43- job_language <- readRDS("job_language_list.RDS")
43+ job_language <- readRDS("../output/job_language_list.RDS")
4444 }
4545
4646 df_jobs_en <- df_jobs |>
4747 filter(job_language=="en")
4848
49-saveRDS(df_jobs_en, "jobs_in_english.RDS")
49+saveRDS(df_jobs_en, "../output/jobs_in_english.RDS")
5050
5151
5252 df_jobs_non_en <- df_jobs |>
53- filter(job_language=="en")
53+ filter(job_language!="en")
5454
5555
56-saveRDS(df_jobs_non_en, "jobs_not_in_english.RDS")
56+saveRDS(df_jobs_non_en, "../output/jobs_not_in_english.RDS")
5757
5858
5959