Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 812633
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 15, 20262026-05-15T01:13:58+00:00 2026-05-15T01:13:58+00:00

I’m presently working with data in text files. I need to use algorithm called

  • 0

I’m presently working with data in text files. I need to use algorithm called principal component analysis so I have counted the words in text file which occurred more than one time in text file for eg

relation occured times
help occured 6 times
between OCCURED 3 TIMES
Analysis occurred 4 times
component occured 5 times
present occurred 6 times

So by taking count of above distinct words i need to form matrix of m x n. I am using C#.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-15T01:13:59+00:00Added an answer on May 15, 2026 at 1:13 am

    Several points:

    1. PCA is simple. However, you really need understand it before using it. This is a pity as it is not a black box tool, like a clustering algorithm.

    2. PCA is performed on the covariance matrix(that is X’*X, where each row of X is a text document). You can see that you cannot store a # of words by # of words matrix in memory. So for text data, you cannot directly use PCA. You need to use SVD and this technique is called latent semantic analysis. PCA and SVD are the same when the data are centered. In practice, data centering is not applied to text data as centering causes sparse into dense.

    3. Both PCA and SVD is easy, several lines of Matlab code. Only several lines of C# code if you have a linear algebra library for eigen-decomposition or SVD. The hard part as I noted is that you need to understand them.

    4. A more popular method to analyze text documents is probabilistic latent semantic analysis. Which is easy to understand and easy to code without using any matrix decompositions. Of course, you still need to learn some math.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Ask A Question

Stats

  • Questions 455k
  • Answers 455k
  • Best Answers 0
  • User 1
  • Popular
  • Answers
  • Editorial Team

    How to approach applying for a job at a company ...

    • 7 Answers
  • Editorial Team

    What is a programmer’s life like?

    • 5 Answers
  • Editorial Team

    How to handle personal stress caused by utterly incompetent and ...

    • 5 Answers
  • Editorial Team
    Editorial Team added an answer It sounds like you didn't set up your Audio Session… May 15, 2026 at 10:18 pm
  • Editorial Team
    Editorial Team added an answer Try setting the middle row to this... <RowDefinition Height="*" /> May 15, 2026 at 10:18 pm
  • Editorial Team
    Editorial Team added an answer I found the solution for this problem. I figured I… May 15, 2026 at 10:18 pm

Trending Tags

analytics british company computer developers django employee employer english facebook french google interview javascript language life php programmer programs salary

Top Members

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.