how do I write French accented characters to csv for excel? - r

I have a vector of raw text that includes English and French words, so there are French words that have accented characters like this:
1 entretien ménager
2 concepteur réseaux
3 service à la clientèle
4 sécurité
5 infirmière auxiliaire
6 opérateur de machinerie en usine
7 consultant stratégique
8 ménage
9 ingénieur civil, gérant projet
10 éducatrice
The command Encoding(variable) tells me that there's a mix of 'unknown' and UTF-8 encodings. All of the ones above are coded as UTF-8.
This code replicates the problem on my mac:
library(foreign)
vec<-c('sécurité', 'service à la clientèle', 'assembleur', 'labour')
write.csv(data.frame(vec), file='~/Desktop/test.csv')
I have tried the same with write_excel_csv() and I get the same results.
I can only assume this is some kind of problem with the utf-8 encoding, but I can' t see my way to figure this out.
Thank you.
Results of sessionInfo()
R version 3.4.1 (2017-06-30)
Platform: x86_64-apple-darwin15.6.0 (64-bit)
Running under: macOS High Sierra 10.13.4
Matrix products: default
BLAS: /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
LAPACK: /Library/Frameworks/R.framework/Versions/3.4/Resources/lib/libRlapack.dylib
locale:
[1] en_CA.UTF-8/en_CA.UTF-8/en_CA.UTF-8/C/en_CA.UTF-8/en_CA.UTF-8
attached base packages:
[1] grid stats graphics grDevices utils datasets methods base
other attached packages:
[1] readr_1.1.1 bindrcpp_0.2.2 labelled_1.0.0 haven_1.1.1.9000
[5] survey_3.32-1 survival_2.41-3 Matrix_1.2-10 car_2.1-5
[9] stargazer_5.2 foreign_0.8-69 tidyr_0.8.0 dplyr_0.7.4
[13] ggplot2_2.2.1
loaded via a namespace (and not attached):
[1] Rcpp_0.12.16 pillar_1.2.1 compiler_3.4.1 nloptr_1.0.4
[5] plyr_1.8.4 bindr_0.1.1 forcats_0.3.0 tools_3.4.1
[9] lme4_1.1-13 tibble_1.4.2 gtable_0.2.0 nlme_3.1-131
[13] lattice_0.20-35 mgcv_1.8-17 pkgconfig_2.0.1 rlang_0.2.0
[17] cli_1.0.0 rstudioapi_0.7 yaml_2.1.18 parallel_3.4.1
[21] SparseM_1.77 hms_0.4.1 MatrixModels_0.4-1 nnet_7.3-12
[25] glue_1.2.0 R6_2.2.2 minqa_1.2.4 purrr_0.2.4
[29] magrittr_1.5 scales_0.5.0 MASS_7.3-47 splines_3.4.1
[33] assertthat_0.2.0 pbkrtest_0.4-7 colorspace_1.3-2 quantreg_5.33
[37] utf8_1.1.3 lazyeval_0.2.0 munsell_0.4.3 crayon_1.3.4 `
I should add, I have looked at some of the issues on GitHub and SO such as this, this, this, but have not found my answer.

Related

"object not found": R seems to not see my variables

When I do anything with my variables, for example: Y<-cbind(bank)or liq <-Liq, the error object not found appears. I guess, something is wrong with reading the excel file. I use library("readxl") and h<-read_xlsx("banks_NA.xlsx")to download the dataset.
Here is info about the session and data:
`>ls(h)
[1] "Da" "Dh_a" "Dh_l1" "Dh_m1" "Dl1" "Dm1"
[7] "K" "La_a" "La_l1" "La_m1" "Lh_a" "Lh_l1"
[13] "Lh_m1" "Liq" "Ll1" "Lm1" "Mac" "Na_Da"
[19] "OtR" "OtR_a" "PrT" "Pro" "S" "S_Bl"
[25] "S_Bo" "S_S" "bank" "date" "dis" "foreign"
[31] "live" "rD" "rL" "rQi" "rS" "rSc"
[37] "rSi" "region" "state" "ta"`
`> sessionInfo()
R version 3.5.0 (2018-04-23)
Platform: x86_64-apple-darwin15.6.0 (64-bit)
Running under: macOS Sierra 10.12.6
Matrix products: default
BLAS: /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
LAPACK: /Library/Frameworks/R.framework/Versions/3.5/Resources/lib/libRlapack.dylib
locale:
[1] C
attached base packages:
[1] stats graphics grDevices utils datasets methods
[7] base
other attached packages:
[1] sandwich_2.4-0 readxl_1.1.0 plm_1.6-6
[4] openxlsx_4.0.17 gplots_3.0.1 Formula_1.2-2
[7] foreign_0.8-70 class_7.3-14 caTools_1.17.1
[10] car_3.0-0 carData_3.0-1
loaded via a namespace (and not attached):
[1] Rcpp_0.12.16 cellranger_1.1.0 pillar_1.2.2
[4] compiler_3.5.0 forcats_0.3.0 bitops_1.0-6
[7] miscTools_0.6-22 tools_3.5.0 tibble_1.4.2
[10] nlme_3.1-137 lattice_0.20-35 rlang_0.2.0
[13] cli_1.0.0 maxLik_1.3-4 curl_3.2
[16] haven_1.1.1 rio_0.5.10 gtools_3.5.0
[19] lmtest_0.9-36 grid_3.5.0 data.table_1.10.4-3
[22] bdsmatrix_1.3-3 gdata_2.18.0 magrittr_1.5
[25] MASS_7.3-50 assertthat_0.2.0 abind_1.4-5
[28] KernSmooth_2.23-15 utf8_1.1.3 crayon_1.3.4
[31] zoo_1.8-1`
Yesterday, I did exactly the same operations and everything worked perfectly. Today I decided to update R studio and now everything is corrupted...

rate_limit() returns wrong reset time

rate_limit() from rtweet returns wrong value in column reset. Instead of showing 15, it shows values around 14.59. It varies between 14.59 and 14.61. I think it is not a problem with rtweet package but probably with either my system or time I have set on my computer.
I can not figure out what might be the reason. The only thing I managed to find out is that when I use Sys.time on my computer and some other computer, on which rate_limit works fine, there is 25 seconds difference. 25 seconds is around 41% of a minute so it seems that it might be a problem, however I don't know how to solve it. Below SessionInfo:
R version 3.4.3 (2017-11-30)
Platform: x86_64-apple-darwin15.6.0 (64-bit)
Running under: macOS High Sierra 10.13.3
Matrix products: default
BLAS: /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
LAPACK: /Library/Frameworks/R.framework/Versions/3.4/Resources/lib/libRlapack.dylib
locale:
[1] pl_PL.UTF-8/pl_PL.UTF-8/pl_PL.UTF-8/C/pl_PL.UTF-8/pl_PL.UTF-8
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] bindrcpp_0.2 feather_0.3.1 magrittr_1.5 forcats_0.2.0 stringr_1.2.0 dplyr_0.7.4
[7] purrr_0.2.4 readr_1.1.1 tidyr_0.7.2 tibble_1.3.4 ggplot2_2.2.1 tidyverse_1.2.1
[13] rtweet_0.6.0
loaded via a namespace (and not attached):
[1] reshape2_1.4.3 haven_1.1.0 lattice_0.20-35 colorspace_1.3-2 htmltools_0.3.6 base64enc_0.1-3
[7] yaml_2.1.14 rlang_0.1.4 foreign_0.8-69 glue_1.1.1 bit64_0.9-7 modelr_0.1.1
[13] readxl_1.0.0 bindr_0.1 plyr_1.8.4 munsell_0.4.3 gtable_0.2.0 cellranger_1.1.0
[19] rvest_0.3.2 evaluate_0.10.1 psych_1.7.8 knitr_1.17 httpuv_1.3.5 parallel_3.4.3
[25] curl_2.8.1 broom_0.4.2 Rcpp_0.12.13 openssl_0.9.7 scales_0.5.0 backports_1.1.1
[31] jsonlite_1.5 bit_1.1-12 mnormt_1.5-5 hms_0.3 digest_0.6.12 stringi_1.1.5
[37] grid_3.4.3 rprojroot_1.2 cli_1.0.0 tools_3.4.3 lazyeval_0.2.0 crayon_1.3.4
[43] pkgconfig_2.0.1 xml2_1.1.1 lubridate_1.7.2 assertthat_0.2.0 rmarkdown_1.6 httr_1.3.1
[49] rstudioapi_0.7 R6_2.2.2 nlme_3.1-131 compiler_3.4.3
Somehow the issue solved itself. I changed timezone (traveled from US back to Europe) and now it works fine.

Why is my resource_map function in R asking for a quasiquoted argument?

I am trying to run (in R) this off-the-shelf example from the Resource Map function help page:
library(bupaR)
library(eventdataR)
data(patients)
resource_map(patients)
and somehow it is returning this error:
Error: UQ() can only be used within a quasiquoted argument
Can someone help me understand and propose a fix as well? Is it something with my R/Rstudio, because this example comes straight from the "resource_map" help page.
Here is the sessioninfo:
R version 3.4.3 (2017-11-30) Platform: x86_64-apple-darwin15.6.0
(64-bit) Running under: macOS High Sierra 10.13.3
Matrix products: default BLAS:
/System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
LAPACK:
/Library/Frameworks/R.framework/Versions/3.4/Resources/lib/libRlapack.dylib
locale: [1]
en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
attached base packages: [1] stats graphics grDevices utils
datasets methods base
other attached packages: [1] bindrcpp_0.2 xesreadR_0.2.2
processmonitR_0.1.0 processmapR_0.2.1 eventdataR_0.1.2
petrinetR_0.1.0 [7] edeaR_0.7.2 bupaR_0.3.2
loaded via a namespace (and not attached): [1] Rook_1.1-1
purrr_0.2.4 ggthemes_3.4.0 colorspace_1.3-2
viridisLite_0.3.0 miniUI_0.1.1 [7] htmltools_0.3.6
yaml_2.1.17 plotly_4.7.1 XML_3.98-1.10
rlang_0.2.0 pillar_1.2.1 [13] glue_1.2.0
RColorBrewer_1.1-2 bindr_0.1 plyr_1.8.4
stringr_1.3.0 munsell_0.4.3 [19] gtable_0.2.0
visNetwork_2.0.3 htmlwidgets_1.0 forcats_0.3.0
httpuv_1.3.6.2 DiagrammeR_0.9.2 [25] Rcpp_0.12.15
xtable_1.8-2 readr_1.1.1 scales_0.5.0
jsonlite_1.5 rgexf_0.15.3 [31] mime_0.5
gridExtra_2.3 brew_1.0-6 ggplot2_2.2.1 hms_0.4.1
digest_0.6.15 [37] stringi_1.1.6 dplyr_0.7.4
shiny_1.0.5 grid_3.4.3 influenceR_0.1.0
tools_3.4.3 [43] magrittr_1.5 lazyeval_0.2.1
tibble_1.4.2 tidyr_0.8.0 pkgconfig_2.0.1 xml2_1.2.0
[49] data.table_1.10.4-3 downloader_0.4 lubridate_1.7.3
httr_1.3.1 assertthat_0.2.0 rstudioapi_0.7 [55]
viridis_0.5.0 R6_2.2.2 igraph_1.1.2
shinyTime_0.2.1 compiler_3.4.3
Thanks!

koRpus package utilising TreeTagger within R (RStudio) on macOS

im exploring the koRpus package in R on macOS, in trying to use the treetag function on the following object:
text
[1] "Because I could not stop for Death -" "He kindly stopped for me -"
[3] "The Carriage held but just Ourselves -" "and Immortality"
using the following syntax
> tagged.text <- treetag(as.vector(paste(text, collapse = '')), format = "obj", debug = TRUE)
i get the following error
file: /var/folders/bt/sdf_vz6d3qbd188c7tkz50gw0000gn/T//RtmpoatWov/tempTextFromObject12d3d169614b6.txt
sys.tt.call: /Applications/treetagger/cmd/tree-tagger-english /var/folders/bt/sdf_vz6d3qbd188c7tkz50gw0000gn/T//RtmpoatWov/tempTextFromObject12d3d169614b6.txt
Error in matrix(unlist(strsplit(tagged.text, "\t")), ncol = 3, byrow = TRUE, :
'data' must be of a vector type, was 'NULL'
when i try the emboldened command above on its equivalent, i get this
matrix(unlist(strsplit(paste(text, collapse = ''), "\t")))
[,1] [1,] "Because I could not stop for Death -He kindly stopped for me
-The Carriage held but just Ourselves -and Immortality"
my workspace is as follows
sessionInfo() R version 3.4.2 (2017-09-28) Platform: x86_64-apple-darwin15.6.0 (64-bit) Running under: macOS High Sierra
10.13.1
Matrix products: default BLAS:
/System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
LAPACK:
/Library/Frameworks/R.framework/Versions/3.4/Resources/lib/libRlapack.dylib
locale: [1]
en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
attached base packages: [1] stats graphics grDevices utils
datasets methods base
other attached packages: [1] quanteda_0.99.12 koRpus_0.10-2
data.table_1.10.4-3 scales_0.5.0 [5] purrr_0.2.4
readr_1.1.1 tidyr_0.7.2 tibble_1.3.4 [9]
tidyverse_1.1.1 gutenbergr_0.1.3 ggplot2_2.2.1
stringr_1.2.0 [13] dplyr_0.7.4 janeaustenr_0.1.5
tidytext_0.1.4
loaded via a namespace (and not attached): [1] reshape2_1.4.2
haven_1.1.0 lattice_0.20-35 colorspace_1.3-2 [5]
htmltools_0.3.6 SnowballC_0.5.1 yaml_2.1.14
rlang_0.1.2 [9] foreign_0.8-69 glue_1.2.0
modelr_0.1.1 readxl_1.0.0 [13] bindrcpp_0.2
bindr_0.1 plyr_1.8.4 munsell_0.4.3 [17]
gtable_0.2.0 cellranger_1.1.0 rvest_0.3.2
psych_1.7.8 [21] evaluate_0.10.1 knitr_1.17
forcats_0.2.0 parallel_3.4.2 [25] broom_0.4.2
tokenizers_0.1.4 Rcpp_0.12.13 backports_1.1.1 [29]
RcppParallel_4.3.20 jsonlite_1.5 fastmatch_1.1-0
mnormt_1.5-5 [33] hms_0.3 digest_0.6.12
stringi_1.1.5 bookdown_0.5 [37] grid_3.4.2
rprojroot_1.2 tools_3.4.2 magrittr_1.5 [41]
lazyeval_0.2.1 pkgconfig_2.0.1 Matrix_1.2-11 xml2_1.1.1
[45] lubridate_1.7.1 assertthat_0.2.0 rmarkdown_1.6
httr_1.3.1 [49] R6_2.2.2 nlme_3.1-131
compiler_3.4.2

ggparagraph is not displayed

I have problem with the function ggparagraph() from ggpubr package. Its content is not drawing. If I use the function:
example(ggparagraph)
The plot is drawing without any problem, but the "ggparagraph area" (below plot) is empty. The same problem is if I use custom scripts, where the ggparagraph is applied.
R does not print any error message. Where can be the problem?
My session info is:
sessionInfo()
R version 3.4.1 (2017-06-30)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 10 x64 (build 15063)
Matrix products: default
locale:
1 LC_COLLATE=Slovak_Slovakia.1250 LC_CTYPE=Slovak_Slovakia.1250
[3] LC_MONETARY=Slovak_Slovakia.1250 LC_NUMERIC=C
[5] LC_TIME=Slovak_Slovakia.1250
attached base packages:
1 stats graphics grDevices utils datasets methods base
other attached packages:
1 ggpubr_0.1.4 magrittr_1.5 gridExtra_2.2.1 dplyr_0.7.2
[5] purrr_0.2.3 readr_1.1.1 tidyr_0.6.3 tibble_1.3.3
[9] ggplot2_2.2.1 tidyverse_1.1.1
loaded via a namespace (and not attached):
1 Rcpp_0.12.12 cellranger_1.1.0 compiler_3.4.1 plyr_1.8.4
[5] bindr_0.1 forcats_0.2.0 tools_3.4.1 jsonlite_1.5
[9] lubridate_1.6.0 nlme_3.1-131 gtable_0.2.0 lattice_0.20-35
[13] pkgconfig_2.0.1 rlang_0.1.2 psych_1.7.5 parallel_3.4.1
[17] haven_1.1.0 bindrcpp_0.2 xml2_1.1.1 stringr_1.2.0
[21] httr_1.2.1 hms_0.3 cowplot_0.8.0 grid_3.4.1
[25] glue_1.1.1 R6_2.2.2 readxl_1.0.0 foreign_0.8-69
[29] reshape2_1.4.2 modelr_0.1.1 scales_0.4.1 rvest_0.3.2
[33] assertthat_0.2.0 mnormt_1.5-5 colorspace_1.3-2 labeling_0.3
[37] stringi_1.1.5 lazyeval_0.2.0 munsell_0.4.3 broom_0.4.2
Finally, I don't find where is the problem, but if I install this package from github (instead of CRAN), all paragraphs are rendered correctly.

Resources