Given data s<-c(1,0,0,0,1,0,0,0,0,0,1,1,1,0,0) I can count 1s and 0s with table or ftable ftable(s,row.vars

Question

0

Asked: May 13, 20262026-05-13T19:14:24+00:00 2026-05-13T19:14:24+00:00

Given data s<-c(1,0,0,0,1,0,0,0,0,0,1,1,1,0,0) I can count 1s and 0s with table or ftable ftable(s,row.vars

0

Given data

s<-c(1,0,0,0,1,0,0,0,0,0,1,1,1,0,0)

I can count 1s and 0s with table or ftable

ftable(s,row.vars =1:1)

and the totals of 11s,01s,10s,00s occurred in s with

table(s[-length(s)],s[-1]).

What would be the clever way to count occurrences of 111s, 011s, …, 100s, 000s? Ideally, I want a table of counts x like

   0 1
11 x x
01 x x
10 x x
00 x x

Is there a general way to compute the total occurrences for all possible sub-sequences of length k=1,2,3,4, … occurred in data?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-13T19:14:24+00:00

Well, it seems like you would first need to generate n-tuples from your vector. The following function should accomplish that:

makeTuples <- function( x, n ){

  # Very inefficient way to loop... but what the heck
  tuples <- list()

  for( i in 1:n ){

    tuples[[i]] <- x[i:(length(x)-n+i)]

  }

  return(tuples)

}

Then you could feed the results of makeTuples() to table() using do.call():

do.call( table, makeTuples(s,3) )

, ,  = 0


    0 1
  0 4 1
  1 3 1

, ,  = 1


    0 1
  0 2 1
  1 0 1

This works because the makeTuples() function returns the tuples as a list of lists. The output isn’t quite as nice as you wanted, but you could write a function to reformat, say:

To:

     0 1
  00 4 1
  01 3 1

It would require looping over the outer n-2 dimensions of the n-dimensional array returned by table, creating row names and concatenating things together.

Update

So, I was just sitting in a Stochastic processes class when I figured out a more or less straight-forward way to produce the output you want without trying to unwind the output of table(). First you will need a function that generates all possible permutations of n selections from your population. The generation of permutations can be done with expand.grid(), but it needs a little sugar-coating:

permute <- function( population, n ){

  permutations <- do.call( expand.grid, rep( list(population), n ) )

  permutations <- apply( permutations, 1, paste, collapse = '' )

  return( permutations )

}

The basic idea is to iterate over the list of permutations and count the number of tuples that match the given permutation. Since you want the results split out into a table, we should select a permutation of n-1 elements from the population and let the last position form the columns of the table. Here’s a function that takes a permutation of size n-1, a list of tuples, and the population the tuples were drawn from and produces a named vector of match counts:

countFrequency <- function(permutation,tuples,population){

  permutations <- paste( permutation, population, sep = '' )

  # Inner lapply applies the equality operator `==` to each
  # permutation and returns a list of TRUE/FALSE vectors.
  # Outer lapply sums the number of TRUE values in each vector. 
  frequencies <- lapply(lapply(permutations,`==`,tuples),sum)

  names( frequencies ) <- as.character( population )

  return( unlist(frequencies) )

}

Finally, all three functions can be combined into a bigger function that takes a vector, splits it into n-tuples and returns a frequency table. The final aggregation operation is done using ldply() from Hadley Wickham’s plyr package as it does a nice job of preserving information such as which permutation corresponds to which row of output matches:

permutationFrequency <- function( vector, n, population = unique( vector ) ){

  # Split the vector into tuples.
  tuples <- makeTuples( vector, n )

  # Coerce and compact the tuples to a vector of strings.
  tuples <- do.call(cbind,tuples)
  tuples <- apply( tuples, 1, paste, collapse = '' )

  # Generate permutations of n-1 elements from the population.
  # Turn into a named list for ldply() to work it's magic.
  permutations <- permute( population, n-1 )
  names( permutations ) <- permutations

  frequencies <- ldply( permutations, countFrequency,
    tuples = tuples, population = population )

  return( frequencies )

}

And there you go:

require( plyr )
permutationFrequency( s, 2 )
  .id 1 0
1   1 2 3
2   0 2 7

permutationFrequency( s, 3 )
  .id 1 0
1  11 1 1
2  01 1 1
3  10 0 3
4  00 2 4

permutationFrequency( s, 4 )
  .id 1 0
1 111 0 1
2 011 1 0
3 101 0 0
4 001 1 1
5 110 0 1
6 010 0 1
7 100 0 2
8 000 2 2

permutationFrequency( sample( -1:1, 10, replace = T ), 2 )
  .id 1 -1 0
1   1 1  2 0
2  -1 0  1 2
3   0 1  0 2

Apologies to my stochastic processes teacher, but functional programming problems in R were just more interesting than the Gambler’s Ruin today…

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Given data s<-c(1,0,0,0,1,0,0,0,0,0,1,1,1,0,0) I can count 1s and 0s with table or ftable ftable(s,row.vars

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply