Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 812633
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 15, 20262026-05-15T01:13:58+00:00 2026-05-15T01:13:58+00:00

I’m presently working with data in text files. I need to use algorithm called

  • 0

I’m presently working with data in text files. I need to use algorithm called principal component analysis so I have counted the words in text file which occurred more than one time in text file for eg

relation occured times
help occured 6 times
between OCCURED 3 TIMES
Analysis occurred 4 times
component occured 5 times
present occurred 6 times

So by taking count of above distinct words i need to form matrix of m x n. I am using C#.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-15T01:13:59+00:00Added an answer on May 15, 2026 at 1:13 am

    Several points:

    1. PCA is simple. However, you really need understand it before using it. This is a pity as it is not a black box tool, like a clustering algorithm.

    2. PCA is performed on the covariance matrix(that is X’*X, where each row of X is a text document). You can see that you cannot store a # of words by # of words matrix in memory. So for text data, you cannot directly use PCA. You need to use SVD and this technique is called latent semantic analysis. PCA and SVD are the same when the data are centered. In practice, data centering is not applied to text data as centering causes sparse into dense.

    3. Both PCA and SVD is easy, several lines of Matlab code. Only several lines of C# code if you have a linear algebra library for eigen-decomposition or SVD. The hard part as I noted is that you need to understand them.

    4. A more popular method to analyze text documents is probabilistic latent semantic analysis. Which is easy to understand and easy to code without using any matrix decompositions. Of course, you still need to learn some math.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Ask A Question

Stats

  • Questions 535k
  • Answers 535k
  • Best Answers 0
  • User 1
  • Popular
  • Answers
  • Editorial Team

    How to approach applying for a job at a company ...

    • 7 Answers
  • Editorial Team

    What is a programmer’s life like?

    • 5 Answers
  • Editorial Team

    How to handle personal stress caused by utterly incompetent and ...

    • 5 Answers
  • Editorial Team
    Editorial Team added an answer // I can access my private function here. privateFunction(); jQuery(window).resize(function… May 17, 2026 at 1:03 am
  • Editorial Team
    Editorial Team added an answer It has something to do with the way the browser… May 17, 2026 at 1:03 am
  • Editorial Team
    Editorial Team added an answer I think I have to do this: var q =… May 17, 2026 at 1:02 am

Trending Tags

analytics british company computer developers django employee employer english facebook french google interview javascript language life php programmer programs salary

Top Members

Related Questions

link Im having trouble converting the html entites into html characters, (&# 8217;) i
this is what i have right now Drawing an RSS feed into the php,
I have just tried to save a simple *.rtf file with some websites and
I have a French site that I want to parse, but am running into
Seemingly simple, but I cannot find anything relevant on the web. What is the
Does anyone know how can I replace this 2 symbol below from the string
I'm trying to decode HTML entries from here NYTimes.com and I cannot figure out
That's pretty much it. I'm using Nokogiri to scrape a web page what has
I want to count how many characters a certain string has in PHP, but
I ran into a problem. Wrote the following code snippet: teksti = teksti.Trim() teksti

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.