In R, I have data a vector of integers. run <- sample.int(9, 1000, replace=T)

Question

0

Asked: June 15, 20262026-06-15T17:48:32+00:00 2026-06-15T17:48:32+00:00

In R, I have data a vector of integers. run <- sample.int(9, 1000, replace=T)

0

In R, I have data a vector of integers.

run <- sample.int(9, 1000, replace=T)
run[sample.int(1000, 100)] <- NA

If at least one of the following patterns, c(1, x, 1, y) or c(x, 1, y, 1) where x and y are either whole numbers or NA, is present, I would like to print out the start index of each pattern and update a count variable for each instance of a pattern. What is the most efficient way of doing this?

I was thinking of using the rle function and testing for every 4 consecutive values for a length of 1, and then testing whether they conform to one of the patterns. However, I am having problems with NAs with this approach since each NA is treated separately. Perhaps there is a better way to do this.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-15T17:48:34+00:00

Taking your usage of sample.int as implying your vector only contains values from 1:9 and NA, here’s a regular expressions approach:

run <- c(1, NA, 1, 3, 1, 1, NA, NA, NA, 1)
run[is.na(run)] <- 0
pat1 <- "(?=1[0-9]1[0-9])" # using a lookahead assertion around the pattern is a way to allow overlapping matches
pat1.idxs <- unlist(gregexpr(pat1, paste(run, collapse=''), perl=TRUE))
pat1.idxs
# match indexes
# [1] 1 3
length(pat1.idxs)
# counts
# [1] 2

Then you would do second pattern similarly.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

In R, I have data a vector of integers. run <- sample.int(9, 1000, replace=T)

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply