I have a large (10000 X 5001) table representing 10000 samples and 5001 different

Question

0

Asked: June 5, 20262026-06-05T01:57:50+00:00 2026-06-05T01:57:50+00:00

I have a large (10000 X 5001) table representing 10000 samples and 5001 different

0

I have a large (10000 X 5001) table representing 10000 samples and 5001 different features of these samples. One of these features represents an output variable of each sample. In other words, I have 5000 input variables and one output variable for each sample.

I know that most of these inputs are irrelevant. Therefore, what I would like to do is determine the subset of input variables that predicts the output variable best. What is the best/simplest way to go about doing this in R?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-05T01:57:54+00:00

Editorial Team

2026-06-05T01:57:54+00:00Added an answer on June 5, 2026 at 1:57 am

You might want to check out Weka. In the Explorer load the data and then go to the Select attributes tab. There you will find several options to get the most informative attributes/features in your dataset.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a large (10000 X 5001) table representing 10000 samples and 5001 different

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply