this post was submitted on 15 Jul 2023
13 points (100.0% liked)
Reddit Migration
37 readers
2 users here now
### About Community Tracking and helping #redditmigration to Kbin and the Fediverse. Say hello to the decentralized and open future. To see latest reeddit blackout info, see here: https://reddark.untone.uk/
founded 1 year ago
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
๐คทโโ๏ธ
It is just a decision that every instance owner can make for themselves (if they are aware of it).
It will be a huge headache for search engines anyways, all posts are basically replicated across all instances and look local to a search engine. So for a single post it will have hundreds of copies in its database and probably outputting all of them as results (for now).
Is it possible/reasonable to have some sort of a fediverse-encompassing api for search engines that would help index only the original threads? A separate instance maybe? Or is it going to stay as is?
The search engines are going to have to deal with that. However you can provide context in the instance in the form of a canonical URL, to tell a search engine where content originated.
@fearout It just occurred to me that all you need is your own server and you just need to index that server only. It basically gets data from all other instances through the standard activityPub protocol. It works differently than traditional crawlers, but the outcome is the same.
@raphael I didn't know the instances would copy the messages. Interesting! I think search engines need to be redesigned to respect robots of the origin instance then. If they are not designed for this, it surely looks local. That's kind of a mess then, from search engine perspective.
Strange enough, if I search with my search engine based on SearXNG the terms "final fantasy site:kbin.social", then it finds a few links. They are only based on tags or person, not the actual content. So maybe use tags, if you want to get indexed anyway.