change data in column [closed] - r

It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 10 years ago.
I have boxscore data from the NFL and some of the data is obviously incorrect. For example for some games the number of sacks is negative, which is impossible. This column is named SackNumOff. How do I change any negative values in this column to zero?

Something like this:
dat$columnname[dat$columnname < 0] = 0
Replaces all negative numbers by 0. The idea is that you can use a subset [] both to extract a subset and assign values to a subset.

Related

Deleting a Row in R based on value in column [closed]

It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 9 years ago.
Have a dataframe in R, I want to delete all rows in that dataframe where column X has values >100%. Whats the best way to do this?
Appreciate the help.
If your column X contains numbers (which I'm pretty sure it does although your use of % symbols gives bit different impression), then you can select the rows i where X[i]<100 like this:
datasetnew <- dataset[dataset$X<=100,]
But if you really have percentages in the column, i.e. values in X are something like "10%","23%","103%", then you need to remove the % first, for example using the gsub function:
datasetnew <- dataset[as.numeric(gsub(dataset$X,"%",""))<=100,]

Applying Loops on Levels [closed]

It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 10 years ago.
I have a huge csv file of sports (EPL) data which encompasses player performance in every game for their respective teams. I would like to run a loop to compare the amount of times a team has scored first in a match (the data is called First.Goal).
I know how to calculate them individually, e.g for Liverpool from a csv called Prem1112:
Prem<-read.csv("Prem1112.csv")
sum(subset(Prem,Team=='Liverpool',First.Goal))
Ideally I'd like to run the loop so I wouldn't have to calculate all 20 teams individually. Any ideas?
What about this:
aggregate(First.Goal ~ Team, Prem, sum)

Abbreviating variables in R [closed]

It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 10 years ago.
I am using a data set called Forbes2000 which is provided by the package HSAUR. I am able to plot the data but not able to abbreviate each point with the corresponding country name. Here is the code I have tried:
Forbes2000top50ccompanies <- head(Forbes2000[order(Forbes2000$profits, decreasing= T),], n = 50)
plot(sales ~ assets,data=Forbes2000top50ccompanies)
This will give you labels that are the first 4 letters of the country names and make them smaller than would be the default:
with(Forbes2000top50ccompanies,
text(x=assets, y=sales,
labels=substr(Forbes2000top50ccompanies$country, 1, 4), cex=0.6) )

Replace contents of multiple cells in a dataframe [closed]

It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 10 years ago.
I need to replace the contents of multiple cells in a dataframe. Using mtcars as an example, how would I replace any cells which contain 1 with one in the vs column?
mtcars$vs[mtcars$vs == 1] <- "one"
or
mtcars[mtcars$vs == 1, "vs"] <- "one"
Something like this:
mtcars$vs[mtcars$vs == 1] <- "one"

How to get Standard Deviation of a single number in C# [closed]

It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 11 years ago.
How to calculate standard deviation of a single decimal value, in C#
I do it in SAS code as the following:
std(x y m ) = xSTD YSTD mSDT;
It is simply 0 see the wiki article.
Let X be the number
Calculate EV of [x] which is x
Subtract EV from each element in set which is [0]
Take the average of [0] which is 0
Take the square root of 0 which is 0

Resources