This might be very trivial but I could not find an easy solution anywhere.

Question

0

Asked: June 19, 20262026-06-19T04:07:17+00:00 2026-06-19T04:07:17+00:00

This might be very trivial but I could not find an easy solution anywhere.

0

This might be very trivial but I could not find an easy solution anywhere. I am trying to create a script in R to count entries in one column that belong to a one of 3 categories specified another column. I have a list of clinical patients with ID numbers (more than one entry for the same ID) that have been seen by two services (a or b). I need to know how many ID have been seen by service a and by service b and service c, but counting repeated visits by a service only once (so basically the number of patients that have used each service at least once) – hope this makes sense, here is an example to explain.

Example:



     ID    Category

     A001  a

     A002  a

     A002  a

     A002  b

     A003  b

     A003  b

     A005  c

     A001  a

     A004  b

     A004  b

     A006  c

     A006  a

Output should be something like:
     a=3
     b=3
     c=2

This is what I have done, but I am quite stuck, and this might not be good at all!
 DataString<- matrix(nrow=dim(refnum)[1], ncol=1)
 for (i in 1:dim(refnum)[1]){
   DataString[i,1]<- paste(refnum[i,], collapse = '')
 }

 #generate vector of unique strings
 uniqueID<- unique(DataString)

 #create new matrix to store new IDs
 newID<- matrix(nrow=dim(data)[1], ncol=1)

 #initiate index n
 n<-0
 #loop through unique strings
 for (i in 1:dim(refnum)[1]){
   #increment n by 1 for each increment through unique strings
   n<- n+1
   #loop through data rows
   for (j in 1:dim(data)[1]){    
     #find matches with string i
     index<- which(DataString == uniqueID[i,1])
     #assign new ID to matching rows
     newID[index,1]<- n
   }
 }

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-19T04:07:19+00:00

Editorial Team

2026-06-19T04:07:19+00:00Added an answer on June 19, 2026 at 4:07 am

One of the many solutions:

table(df[!duplicated(df), "Category"])

# a b c 
# 3 3 2

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

This might be very trivial but I could not find an easy solution anywhere.

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply