I’ve got a column in a CSV file that looks like c(,1,1 1e-3) (i.e.

Question

0

Asked: June 8, 20262026-06-08T03:00:01+00:00 2026-06-08T03:00:01+00:00

I’ve got a column in a CSV file that looks like c(,1,1 1e-3) (i.e.

0

I’ve got a column in a CSV file that looks like c("","1","1 1e-3") (i.e. white space seperated). I’m trying to run through all values, taking the sum() of values where there is at least one value and returning NA otherwise.

My code currently does something like this:

x <- c("","1","1 2 3")
x2 <- as.numeric(rep(NA,length(x)))
for (i in 1:length(x)) {
  si <- scan(text=x[[i]],quiet=TRUE)
  if (length(si) > 0)
    x2[[i]] <- sum(si)
}

I’m struggling to make this fast; x is really a set of columns from a CSV file containing a few hundred thousand rows and thought it should be possible to do this in R.

(these are thinned samples from the posterior of a reversible jump MCMC algorithm, hence combining multiple values as the dimensionality changes throughout the file and I want useful columns).

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-08T03:00:04+00:00

This seems to perform a bit faster and may work for you.

#define a helper function
f <- function(x) sum(as.numeric(x))
unlist(lapply((strsplit(x3, " ")), f))
#-----
[1] 0 1 6

This will return a zero instead of NA, but maybe that isn’t a deal breaker for you?

Let’s see how this scales to a larger problem:

#set up variables
x3 <- rep(x, 1e5)
x4 <- as.numeric(rep(NA,length(x3)))
#initial approach
system.time(for (i in 1:length(x3)) {
  si <- scan(text=x3[[i]],quiet=TRUE)
  if (length(si) > 0)
    x4[[i]] <- sum(si)
})
#-----
   user  system elapsed 
   30.5     0.0    30.5 

#New approach:
system.time(unlist(lapply((strsplit(x3, " ")), f)))
#-----
   user  system elapsed 
   0.82    0.01    0.84

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’ve got a column in a CSV file that looks like c(,1,1 1e-3) (i.e.

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply