Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 4036852
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 20, 20262026-05-20T12:16:28+00:00 2026-05-20T12:16:28+00:00

I would like to aggregate our IIS logs and be able to quickly perform

  • 0

I would like to aggregate our IIS logs and be able to quickly perform simple queries against them (e.g. how many times as page x hit in the past month etc.)

I’d like aggregrate this data (maybe down to 10 minutes or so granularity on the time dimension) into an SSAS cube.

We already import the logs via SSIS into tables. The tables are becoming very large, and I’d like to start deleting older data (say more than 3 months ago), whilst keeping the history in the cube (so I can still query over longer than 3 months). Is this a reasonable thing to do – clearly I’d not be able to rebuild my cube easily should I wish to change it… and I guess I’d need to start backing up the cube as if it was a database?

Currently I don’t have a PK on the data – Would it be better to add an identity column or create a composite PK out of date,time and url?

Any feedback from someone that has successfully implemented this would be fantastic 🙂

Thanks!

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-20T12:16:29+00:00Added an answer on May 20, 2026 at 12:16 pm

    I haven’t done exactly this, but I’ll give you opinions on as much as I can:

    The tables becoming large, why is this a problem – Is it storage space, or speed?

    If speed, consider using table partitioning to split your large tables. You can partition them by date range, then switch the partitions into another table (Reducing the size of the original), this is a pure metadata operation and will be instant. SSAS can then use a view that unions both tables when it processes, if you ever need to rebuild you can.

    If storage space, have you looked at compression in SQL Server (Available in 2008, not sure what version you’re on?).

    Personally I just wouldn’t be happy without the ability to rebuild the cube – Also remember that the cube makes a copy of the source data (Or rather the parts it uses as per the DSV) so you may not save as much as you think if you delete old data & treat the cube as the ‘storage device’. Does your cube only use a portion of the tables? What size is it compared to the underlying data?

    A PK on the data is not strictly required for SSAS – BUT – I always use them, primarily to prevent duplicate loads (I also load by time – Check data is newer than last loaded) but it’s good to have a PK constraint preventing duplicate loads.

    For your PK, Date, Time, URL sounds good, but depends on how busy your site is. Your example would not allow two people to view the same URL at the same second. Could you add IP Address to the PK? What if a visitor refreshed quickly? Would/Could you treat that as a duplicate, and remove it in the SSIS dataflow?

    Good luck, let me know if you have any questions on what I’ve said.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I have a database with many rows and I would like to consecutively aggregate
I would like to perform an aggregate query on a DataTable to create another
I have objects in a tree structure, I would like to aggregate status information
Would like to be able to set colors of headings and such, different font
I would like to do aggregate calculations based on month for a datetime field.
I have 2 seperate select statements, using aggregate functions in each. I would like
Does HQL support conditional expressions in aggregate functions? I would like to do something
I would like to aggregate a data.frame by an identifier variable called ensg .
I have a simple SQL statement like the following and I would like to
My needs I would like our in house, standard product to fire different events

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.