In Settings -> Reading, the Site Visibility Options are quite limited, I can allow all search engines to index my blog or none. I registered it on Google but I don't want some famous name search engines to index it because most of them use illegal methods to get a better page rank. They claim to be search engines but they copy the data and they keep obsolete ones in their caches too, they don't use their own web crawlers, they reuse the search engines result pages of other engines (they behave like meta-engines). I have an hard time when I ask them to stop using my personal data. They claim to use only publicly available information but lots of people complained about their private data from LinkedIn to be used too. I have never registered on such "name search engines", they use my copyrighted contents without my prior consent and they use my contents under CC NC (non commercial) in their pages with ads. I succeeded in forcing some of them to stop indexing some of my pages (123People, Yatedo, Webidoo) but Yasni.fr still annoys me a lot. It even uses blind texts and it refuses removing defamatory and injurious contents from its index.
These name search engines don't provide any valuable service and they are a huge source of problems for lots of webmasters. Please can you find a solution to allow us to disallow or at least discourage these fake engines from indexing blogs hosted on WordPress.com? Please can you highly "encourage" Yasni Gmbh to provide a mean of identifying its web crawlers or a simple setting to prevent it from scanning blogs on WordPress.com?
I have written more than 200 bilingual (in French and in English) articles with tons of pictures, videos, drawings, ... Yasni doesn't respect my author's right (I don't like the word "copyright"). I have spent more than 3 years in improving my blog and I'm very satisfied with WordPress.com but how can I go on publishing here if some private corporations can still my creations to make money? I already contacted Yasni 4 times since 2010, its support always uses wrong arguments to defend its service, it violates some terms of Bern convention, it claims that as soon as a content is public, the author wastes all rights on it which is plain wrong. I'm going to contact the CNIL (National Commission for computer science and freedoms, France) and Google Inc., your help is more than welcome. I'm almost out of options.
The blog I need help with is gouessej.wordpress.com.