I’m in a situation where I’m given a character string and need to determine

Question

0

Asked: June 3, 20262026-06-03T00:41:36+00:00 2026-06-03T00:41:36+00:00

I’m in a situation where I’m given a character string and need to determine

0

I’m in a situation where I’m given a character string and need to determine if the language of the string is Spanish or English. I plan on parsing for stop words – Spanish (`de, es, si, y”) vs English (‘of’, ‘is’, ‘if’, ‘and’)? If there more Spanish occurrences than English occurrences, then, I conclude the page is Spanish.

Are there any Ruby snippets already available to do this? If not, what would be good method for string parsing or regex to do this?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-03T00:41:39+00:00

If you have a string that contains a sentence (or a series of words, at least), you can use string.split(' ') to split the string into an array of words. From there, you can use .each to iterate through the list and process each word. For example:

def detect_language(sentence)
    english_count = 0
    spanish_count = 0
    sentence.split(' ').each {|word|
        if looks_like_english(word)
            english_count += 1
        elsif looks_like_spanish(word)
            spanish_count += 1
        end
    }

    retval = ["spanish", "unknown", "english"]
    retval[(english_count <=> spanish_count) + 1]
end

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m in a situation where I’m given a character string and need to determine

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply