it appears that the SEO optimum is "repeat common search terms but in natural language" and so now every google result for thing like "fix coil whine" or "toilet not filling" are FAQ style posts by domains you've never heard that are like:

How to fix coil whine?
What causes coil whine?
Can coil whine cause damage?

the search situation is dire because sure "toilet not filling" doesn't tend to have misinformation, but if any schmuck can register a 4 dollar domain and get gpt2 to blast out some misinfo that the google algo loves and make bank on advertising (also sold by google) then how am I supposed to trust google when I ask it about things that tend to have common misconceptions or falsehoods like pest remedies or medical symptoms?

here's a hint. if you're on a page you found through google, ctrl+f for the phrase "Amazon Services LLC Associates Program"

I wish got a full-book search engine like hathitrust

if we assume from now on that google can't source good info, it may be time to move to a "have a collection of reputable sites and use their internal search, or use site:domain"

before you suggest duckduckgo, I don't think their search quality is that much better. I think this is an issue affecting all search engines

maybe the right path is to use a search engine that lets you set a domain whitelist

a search engine that only searches wikipedia, hathitrust,,, and maybe stack exchange. that would be hype. any other domains?

@SuricrasiaOnline here u go:
it's a google custom search engine but i added some js before loading it that makes it so the ad script doesn't load even without adblock
it searches wikipedia, wiktionary, wikimedia commons, wikidata, all the SE sites, github, gitlab, sourcehut, codeberg,, hathitrust, project gutenberg, unsplash, and pexels for now

@SuricrasiaOnline ok i added a few more sites (MDN, css-tricks, old.reddit, emojipedia,, etymonline, emojipedia, arch wiki,

@easrng @SuricrasiaOnline probably better to use a custom centrality algo (like custom pagerank) centered around a few certain sites you deem to be the “Ideal Archetype” of the content you want and then use huerestics to adjust the weight of a site. Huerestics could include:

How much Readability strips
How much uBlock Origin filters ( does this)
Manual actions
JavaScript and tracker weight (Marginalia does this)
Structured data (FairSearch, Google, Bing, Yandex, and maybe Petal do this)

Sign in to participate in the conversation

cybrespace: the social hub of the information superhighway jack in to the mastodon fediverse today and surf the dataflow through our cybrepunk, slightly glitchy web portal support us on patreon or liberapay!