PHPSpreadsheet load CSV setting default data format - phpexcel

I am loading a CSV file via PhpOffice\PhpSpreadsheet\Reader\Csv. My 1st column contains a large number (15191100011) that I want loaded as the data format TEXT and not GENERAL. Because if it loaded as general it displays as 1.52E+10.
If you manually import via Excel | File | Import the last question asked is the column data format which defaults to General, but I select Text for the first column.

Related

Concatenate numbers into strings R programming

I have two columns which contain numeric values. I would like to create a third by concatenating them as a string. I do that using the paste function. but on exporting the data to CSV. The third column gets converted into date
Desired output (Column C):
A B C
2 3 2-3
4 5 4-5
A & B is contained in a dataset called concat
code written till now as under
concat$C <- paste(concat$A,concat$B, sep="-", collapse = NULL)
This shows the desired output on screen but on writing to CSV, values in C column changes to date format.
As the comments have pointed out this a result of the way Excel (or other applications) interpret column formats. Similar problems happen if you want to export numeric columns with leading 0s, open US-format csv in countries like Germany, etc.
The easiest solution to all these problems is to not open .csv in Excel directly.
Instead open a new, empty Excel and use the Import Assistant in the data tab. This will allow you to import csv or any other separated-text-format and control the column formats before importing!
Be aware that simply opening .csv,.tsv, etc. in Exel and then saving in the original file format will overwrite all data to the Excel assumed data format! So always use the import assistant.

Read dataset.train and dataset.test in r

I am doing a project about high dimension data set, the data set is from http://archive.ics.uci.edu/, it can also be found in github (https://github.com/minghust/MaliciousExeDetect/tree/master/TrainData)
The file is called "dataset.train","tst.test". I want to read them in R.
My question is whether there are file format, called .train, and .test file. They are not csv, or txt file. How i can open it and import it in R?

excel spreadsheet import/export issue in R

I have an excel file with the first column completely blank. When i read it into R using read.xls the first column does not appear. However, when I export the data set, that column reappears but with numbers in it corresponding to the row.
My issue is that I do need my expoted xls sheet to retain that first blank column (including no header/variable name).
How do I retain that column or make sure the exported spreadsheet has it?

How to split one column containing several values so each column only contains one value?

starting situation as follows:
I've got a csv files with roughly 3000 rows, but only 1 column. In each of the rows there are several values included.
Now I want to assign only one value per column.
How do I manage to do that?
convert the file into txt format and then open the data using MS excel. Don't directly open the file. Open it using Open option in file menu. When you do this a text wizard will appear. You can then split your data by using delimited such as commas, spaces and form multiple columns. Once you are done with it, you save the file in csv format

read a selected column from multiple csv files and combine into one large file in R

Hi,
My task is to read selected columns from over 100 same formatted .csv files in a folder, and to cbind into a big large file using R. I have attached a screen shot in this question for a sample data file.
This is the code I'm using:
filenames <- list.files(path="G:\\2014-02-04")
mydata <- do.call("cbind",lapply(filenames,read.csv,skip=12))
My problem is, for each .csv file I have, the first column is the same. So using my code will create a big file with duplicate first columns... How can I create a big with just a single column A (no duplictes). And I would like to name the second column read from each .csv file using the value of cell B7, which is the specific timestamp of each .csv file.
Can someone help me on this?
Thanks.

Resources