I have some labels and attributes from text. I am looking for patterns (combinations

Question

0

Asked: May 20, 20262026-05-20T10:47:51+00:00 2026-05-20T10:47:51+00:00

I have some labels and attributes from text. I am looking for patterns (combinations

0

I have some labels and attributes from text.
I am looking for patterns (combinations of key-value pairs that occur across many documents) of labels and attributes amongst these documents.

What kind of an algorithm and tool should I be looking into? I want to score these patterns based on relevance and importance and not just string matching.

Any kind of inputs would be great.
Thanks

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-20T10:47:51+00:00

If I correctly understand your question, you are talking about association mining. Example: attr1==value1 ==> label=label1 (95% percision)

There are several algorithms, one of them is Apriori.

The second interpretation of your question is feature selection i.e. selecting attributes which has most impact on label prediction. There you can check infogain/chi^2 selection all of this staff you can find in Weka(www.cs.waikato.ac.nz/ml/weka).

If your don’t want to use such algorithms and implement them, most simple implementation will look like:

attributes = new SortedSet()
for a in attributes:
    for label in labels:
         for value in posible_values(a)
            prob = count(a,value, label)/count(label) //this is propability cireteria, chi^2 works better
            if(count(a)>MIN_SUPPORT) //not too rare
                attrbutes.add(prob, (a, value, label))

print(attributes)

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have some labels and attributes from text. I am looking for patterns (combinations

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply