feat: support multiple languages #11

Open
opened 2021-03-27 19:02:23 +01:00 by ntimo · 9 comments
ntimo commented 2021-03-27 19:02:23 +01:00 (Migrated from github.com)

Hello,
first off I would like to thank you for making this awesome web app.

I would like to suggest adding support for multiple languages. Maybe different languages could then be enabled per website.

Thanks,
Timo

Hello, first off I would like to thank you for making this awesome web app. I would like to suggest adding support for multiple languages. Maybe different languages could then be enabled per website. Thanks, Timo
omohokcoj commented 2021-04-03 09:17:08 +02:00 (Migrated from github.com)

Hi @ntimo, thanks for the feedback!

SiteInspector uses https://github.com/omohokcoj/ruby-spellchecker/ library for English spellchecking - that library contains a large list of common English spelling mistakes. Adding spellchecking for other languages is quite hard and requires a decent knowledge of the language - unfortunately, I don't know other languages well enough to be able to build a spellchecker for them.

So I think adding new languages is not possible at the moment unless there are some native speakers willing to work on adding new languages directly into the ruby-spellchecker library.

Hi @ntimo, thanks for the feedback! SiteInspector uses https://github.com/omohokcoj/ruby-spellchecker/ library for English spellchecking - that library contains a large list of common English spelling mistakes. Adding spellchecking for other languages is quite hard and requires a decent knowledge of the language - unfortunately, I don't know other languages well enough to be able to build a spellchecker for them. So I think adding new languages is not possible at the moment unless there are some native speakers willing to work on adding new languages directly into the ruby-spellchecker library.
kosssi commented 2021-04-06 16:36:07 +02:00 (Migrated from github.com)

@omohokcoj Maybe this repository can help for this question https://github.com/ONLYOFFICE/dictionaries ;)

Or http://hunspell.github.io/ with Ruby support (https://rubygems.org/gems/hunspell-ffi)

@omohokcoj Maybe this repository can help for this question https://github.com/ONLYOFFICE/dictionaries ;) Or http://hunspell.github.io/ with Ruby support (https://rubygems.org/gems/hunspell-ffi)
omohokcoj commented 2021-04-06 20:05:22 +02:00 (Migrated from github.com)

@kosssi hunspell dictionary spellchecker treats all unknown words as mistakes and it produces too many false positives when checking website pages. Also, I used to try languagetool and it produced too many false positives as well - that's how I ended up developing a ruby spellchecker library from scratch :)

@kosssi hunspell dictionary spellchecker treats all unknown words as mistakes and it produces too many false positives when checking website pages. Also, I used to try languagetool and it produced too many false positives as well - that's how I ended up developing a ruby spellchecker library from scratch :)
tldev-de commented 2022-01-03 22:11:16 +01:00 (Migrated from github.com)

Perhaps it would be a good solution to be able to configure the language per website. In addition to the languages available in Hunspell, one could then also offer the error detection you developed as less noisy optimization for english language.

I would love to use this tool, but most (for me important) websites are only available in german language, which makes siteinspector a lot less effective!

Perhaps it would be a good solution to be able to configure the language per website. In addition to the languages available in Hunspell, one could then also offer the error detection you developed as less noisy optimization for english language. I would love to use this tool, but most (for me important) websites are only available in german language, which makes siteinspector a lot less effective!
RafaelKr commented 2022-03-07 17:10:46 +01:00 (Migrated from github.com)

Hi @omohokcoj, I just found your tool and it's awesome!

I have a multi-lingual site and it would be great if I could configure that spell checking should only be applied to pages which have set a lang="en-*" (lang="en-US", lang="en-GB", ...) attribute on the html-tag.
So I would have spell- and grammar-checking only for the english area of the page and broken link checking for english and others.

Hi @omohokcoj, I just found your tool and it's awesome! I have a multi-lingual site and it would be great if I could configure that spell checking should only be applied to pages which have set a `lang="en-*"` (`lang="en-US"`, `lang="en-GB"`, ...) attribute on the html-tag. So I would have spell- and grammar-checking only for the english area of the page and broken link checking for english and others.
brunoamaral commented 2023-03-01 18:53:44 +01:00 (Migrated from github.com)

This would be useful for Portuguese as well, so I am just raising my hand up in case I can help move the multilanguage support forward.

This would be useful for Portuguese as well, so I am just raising my hand up in case I can help move the multilanguage support forward.
mediaarchitekt commented 2023-03-08 17:28:02 +01:00 (Migrated from github.com)

Hi,

thank you very much for this great application. There is nothing else comparable to this.

Only one request: if multi-language support is not so easy to implement, it would be helpful to completely disable the reference to spell checking and grammar in the reports. So it is written that there are no mistakes, although grammar and spelling were just not checked. This is misleading and raises expectations to viewers, which cannot be met for all languages except English.

Have a good time
Matthias

Hi, thank you very much for this great application. There is nothing else comparable to this. Only one request: if multi-language support is not so easy to implement, it would be helpful to completely disable the reference to spell checking and grammar in the reports. So it is written that there are no mistakes, although grammar and spelling were just not checked. This is misleading and raises expectations to viewers, which cannot be met for all languages except English. Have a good time Matthias
ecollart2 commented 2023-11-15 12:09:30 +01:00 (Migrated from github.com)

Hello,

that's also a bit frustrating for me having my sites in French; I cannot use your spelling/grammar check but the broken link check part is already a treasure !

Eric Collart

Hello, that's also a bit frustrating for me having my sites in French; I cannot use your spelling/grammar check but the broken link check part is already a treasure ! Eric Collart
WeterPeter commented 2024-03-05 07:09:07 +01:00 (Migrated from github.com)

Still no multi language support? Planned?

Still no multi language support? Planned?
dan added this to the siteinspector project 2025-10-26 12:58:10 +01:00
dan changed title from Multi lang support to feat: support multiple languages 2025-10-26 13:00:03 +01:00
This discussion has been locked. Commenting is limited to contributors.
No description provided.