i have a set of login data for a user_id with a time stamp.

Question

0

Asked: June 1, 20262026-06-01T16:03:03+00:00 2026-06-01T16:03:03+00:00

i have a set of login data for a user_id with a time stamp.

0

i have a set of login data for a user_id with a time stamp.

a user could login multiple times but we need to return records at least an hour apart from one another, starting from the min record. the deduping has to happen at a user level (there can be multiple users)

for eg.

user1 2012-03-07 14:24:30.000
user1 2012-03-07 14:34:30.000
user1 2012-03-07 15:14:30.000
user1 2012-03-07 15:20:30.000
user1 2012-03-07 15:30:30.000
user1 2012-03-08 09:20:30.000
user1 2012-03-08 09:50:30.000
user1 2012-03-08 10:30:30.000
user2 2012-03-07 15:20:30.000

i would only want to see the following records

user1 2012-03-07 14:24:30.000
user1 2012-03-07 15:30:30.000
user1 2012-03-08 09:20:30.000
user1 2012-03-08 10:30:30.000
user2 2012-03-07 15:20:30.000

========================================================================

is there any way to do this in a clean way? we could do this recursively but i was hoping there might be a way to use row_number partition by.

any help is much appreciated!!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-01T16:03:04+00:00

In Sql Server 2005 or newer this CTE will return table of LoginAt datetimes removing the ones less than hour apart from already selected LoginAts.

;with SkipHour(UserID, LoginAT, rn) as (
  select UserID, min(LoginAt), cast (1 as bigint)
    from LogTable
   group by UserID
  union all
  select SkipHour.UserID, LogTable.LoginAt,
         row_number() over (partition by SkipHour.UserID 
                            order by Logtable.LoginAt) rn
  from SkipHour
     inner join LogTable
        on LogTable.UserID = SkipHour.UserID
     where datediff(minute, SkipHour.LoginAt, LogTable.LoginAt) >= 60
     -- Only first rows from previous generation qualify to have children
        and rn = 1
)
select *
from SkipHour
where rn = 1
order by UserID, LoginAT

Crucial part is row_number(). As Sql Server does not allow neither aggregate functions nor top predicate, row_number() is the only way (IMO) to order loginAt datetimes and keep only first one.

Sql Fiddle playground is this way.

UPDATE:

Row numbers are applies for each generation individually. Extract from WITH common_table_expression (Transact-SQL):

Analytic and aggregate functions in the recursive part of the CTE are
applied to the set for the current recursion level and not to the set
for the CTE. Functions like ROW_NUMBER operate only on the subset of
data passed to them by the current recursion level and not the entire
set of data pased to the recursive part of the CTE. For more
information, see J. Using analytical functions in a recursive CTE.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

i have a set of login data for a user_id with a time stamp.

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply