Show HN: I Built Paul Graham's Intellectual Captcha Idea (mentwire.com)

by nowflux 45 comments 29 points
Read article View on HN

45 comments

[−] snissn 39d ago
This is weird political propaganda. The first post misrepresented annual costs of housing.
[−] dwroberts 39d ago
Yeah this was my immediate impression as well. The assertion that homing every homeless person is the same as putting them up in unique per-person accommodation that ends up costing 65 billion is absurd.
[−] littlestymaar 38d ago

> the same as putting them up in unique per-person accommodation

And a new one every year, at that.

[−] adamm255 39d ago
All the references about Data Center water use were from one guy too.
[−] littlestymaar 39d ago
The first one is utterly stupid.

Housing is a very complex issue that goes well beyond the sheer cost of the housing unit.

Do I think “solving homelessness” is easy with $10B? No. Does the calculation made in the answer makes any sense: absolutely not.

[−] Kwpolska 39d ago
Also, how do you solve homelessness? Giving every homeless person a subsidised house isn't going to work out if that person can't afford it (especially when the subsidies run out), because they're also unemployed, or because they have a drinking problem which means they trash the house or spend all their income on alcohol and can't afford rent. For some people, temporary free housing, rehab, and help with finding employment could be a cheap and effective way to get them back on track and able to afford a place with their own income after a few months.
[−] littlestymaar 39d ago

> Also, how do you solve homelessness?

That's the problem: there are many completely different situations that lead someone to become homeless, and the solutions must be tailored to the specific case you are targeting. (it's a variant of the Anna Karenina principle: “All happy families are alike; each unhappy family is unhappy in its own way.”)

And you also need to take into account the role of homelessness as a coercive force in modern societies: it the treat that keeps worker behave at work, prevent unhappy women from leaving their husband, and make sure tenants pay their rent to their landlord and make sure people repay their mortgage before any other expense. Remove it suddenly and then you'll probably end up needing way more housing than what you planned for previously homeless people. And it would absolutely tank the real estate market.

I'm convinced that all of the above would be a good thing for society, but the shock would be absolutely gigantic. It's not just about investing a few billions of dollar, the cost of the housing per se would be negligible in the earthquake that it would cause.

[−] southerntofu 39d ago
Came here to say this. It takes political will, but money is certainly not the issue. It's just like with healthcare, free healthcare for all costs much less per person than privatized healthcare like they have in the USA. It's just a matter of designing the system properly so the money doesn't get siphoned off by parasite investors.
[−] nailer 39d ago
[flagged]
[−] littlestymaar 38d ago

> > The first one is utterly stupid.

> OK, why?

Come on. I literally spent the rest of the comment on that. …

> > Housing is a very complex issue

> OK, why?

Because you can't increase it enough without also tanking the real estate market. And because homelessness has many factors you can't fix with a one size fits all solution.

> > Does the calculation made in the answer makes any sense: absolutely not.

> OK, why?

Because nobody is ever going to pay a brand new home to each homeless every year (which is what the calculation is about).

There's no point in participating in any kind of adult discussion if you can't read the other's comment. Also brandolini's law.

[−] nailer 38d ago
It's possible you and I might be discussing different things by 'the answer'.

The second item (the first is about election predictions) is:

> It would cost about $1 billion annually to end family homelessness. And about $10 billion to end all homelessness. But our politicians would rather spend that on genocide.

with the answer being:

> False. Since 2019, California has spent about $24 billion on homelessness, but in this five-year period, homelessness increased by about 30,000.

Which is fairly clear, I don't think would label 'bullshit' and it doesn't seem like you're trying to refute.

If you mean something else by 'the answer' I apologize.

[−] littlestymaar 38d ago

> with the answer being:

> > False. Since 2019, California has spent about $24 billion on homelessness, but in this five-year period, homelessness increased by about 30,000

Ah, I see the problem now, OP changed their previous answer to this question, which explains the lack of mutual understanding between the two of us.

The previous answer, which I called bullshit, used to multiply the number of homeless people in California by the price of a buying house and used that as an argument. I think we can agree that this is a very poor one as I've not see anyone suggesting buying real estate at market rate and giving it to homeless people for free. Also, it used to be the first question.

That being said, I don't think the question makes any sense anyway even with its updated answer: how you spend the money has as much impact as how much money you spend. The answer means we can agree that California's current policy to fight homelessness is arguably a failure, but that doesn't mean there couldn't be a cheaper policy leading to better results (like how European countries spend much less in healthcare than Americans for a significantly better health and higher life expectancy). But on the flip side, even if such a good policy existed, that wouldn't mean solving homelessness is trivial like the original tweet suggests.

[−] nailer 38d ago

> The answer means we can agree that California's current policy to fight homelessness is arguably a failure, but that doesn't mean there couldn't be a cheaper policy leading to better results

Agreed.

[−] zug_zug 39d ago
I agree. Also put some about global warming on there, or pollution, or efficacy of the covid vaccine.
[−] philipkglass 39d ago
At least one of the test questions was just a screen shot from a tweet. It was difficult to read. I'd suggest extracting text from screen shots with OCR. Apple has built-in functionality for this on their operating systems with Live Text. There are strong open source systems based on small vision language models for this, too. The one I have been recommending lately is GLM-OCR:

https://github.com/zai-org/GLM-OCR

It's fast and can run even on low-resource computers.

---

Does this CAPTCHA actually resist computers? I didn't try feeding the questions I got to an LLM, but my sense is that current frontier models could probably pass all of these too. Making generated text pass the pangram test is simple enough for someone actually writing a bot to spin up automated accounts.

[−] tripplyons 39d ago
I think it's more about resisting some humans than it is about resisting machines.
[−] _alternator_ 39d ago
Two mild concerns: first, I missed one and it told me I didn’t miss any at the end.

Second, some of the logic problems have flawed premises (eg All licensed pilots must pass a medical exam. Jake is a licensed pilot, therefore Jake passed a medical exam.) If you see the flaw in the premise (it assumes no fraud) then the conclusion does not follow.

Im not sure you’re going to be able to actually improve human discourse this way. The idea that it’s ‘irrationality’ that’s the source of xitters problems is far too shallow to really make a change.

[−] rogual 39d ago
I took the pilot one as an abstract logic type question where you're supposed to assume the premise is true, so I said yes and the page said I was right, because that's a "valid logical deduction" or something.

Then there was another question in the same format that said "if you study hard enough you'll pass the exam. You didn't pass, so you didn't study hard enough." So I thought, oh, another logic one, and said yes to that one too, but the page was like, "not quite! You might fail for other reasons!"

[−] Windchaser 39d ago

> If you see the flaw in the premise (it assumes no fraud) then the conclusion does not follow.

Right. Or he could've been grandfathered in.

But more basically: this is logically valid, but not logically sound. These are two different ways in which something may be "true" or "false", and in this format, it's not completely clear, soundness vs validity. Based on context clues like the absurd premise of pilots -> medical exam, I assumed validity, but it's still a weird format.

[−] wlkr 39d ago
Yes, I also assumed an imperfect system with cases of fraud for the medical exam question and was quite surprised by the overly simplistic response.
[−] airza 39d ago
I opened it, it told me it was impossible to build a house in california for less than 350K, i closed it
[−] next_xibalba 39d ago
Same. And I'm not even focused on whether this is a reasonable number or not. The quoted tweet also says "But our politicians would rather spend that on genocide." And I'm asked to evaluate whether this is "accurate" with a thumbs up or thumbs down. (According to Mentwire, it is not accurate). So I'm evaluating both the cost of housing the homeless, but also whether politicians would rather fund genocide. So, this seems like it is not really an intellectual CAPTCHA, but rather an ideological CAPTCHA.

And just to disclose my biases, I would tend to believe that $350k is an absurdly high figure and that politicians are obviously not holding a vote where they are forced to choose between ending homelessness and funding genocide. But I believe that people who disagree with me can be considered intelligent and not "too dumb to pass an intellectual CAPTCHA".

[−] Centigonal 39d ago
One issue with this is that it mixes hypothetical formal logic style problems (where there are clear, inflexible rules) with real life examples (where group membership/traits, cost estimation, and causal attribution are less clear) without always disambiguating which one is which. Fun quiz though!
[−] strangattractor 39d ago
Yes - I was given a syllogism that was logically correct but the Truthyness of the premises where wrong or misleading in the real world.

All pilots need a medical exam to have a license.

John is a pilot.

John has had a medical exam.

Pilots can be licensed without a medical exam. It is illegal for them to fly without a valid medical but the 2 are separate issues. Also LSA pilots do not need a medical.

[−] metalliqaz 39d ago
it's a really nice idea, but of course completely antithetical to the business model of modern social media platforms. So, it will never go anywhere. HN might be the only locale with any real numbers that I could see actually using it. Even BlueSky I think could never risk something like this.

as an interesting thought experiment, consider the questions that TruthSocial would put in. would an average unsophisticated user be able to tell the difference between your product and a hopelessly biased version such as that? they would support the correct answers with their own misinformation. Would it be just another schism of reality?

[−] blamestross 39d ago
Reminds me of IQ tests I took as a kid.

"Finish the sequence" with 4 options and "no pattern" as the choices.

It becomes "what does the moderately intelligent person who wrote the test thinks counts as a pattern" not the intended exercise at all. There was never enough samples to even guess at a real pattern in them.

[−] dogleash 39d ago

> "Make users pass a test on basic concepts like the distinction between necessary and sufficient conditions before they can tweet."

If twitter ever became what he says he wants, he'd quit using it within a month. He already has the option to close twitter and seek out experts' writing. Why is he choosing to bask in the emotions generated by people being wrong on twitter?

It's like listening to a friend complain about twitter being "full of" content that you rarely/never see on your feed. Nah, that's their algorithm and they just told you exactly who they are.

[−] zug_zug 39d ago
Well, fun little experiment, at the very least.

I'll also pick apart this question "The mainstream media gave Trump an 8% chance to win the 2016 election. Him winning clearly shows they were biased and wrong."

Well, defined biased. Pollsters weren't using perfectly random models, so there was systemic bias (as there always is when you can't have a truly representative sample). There was also a sense of group-think that "This couldn't possibly happen, this is beneath our country" from a lot of individuals in and out of media.

That said it doesn't prove a deliberate attempt to deceive, but it certainly was a wake-up call to many in the media.

Point here being -- almost all emotional discourse is taking very nuanced situations and trying to cram them into semi-arbitrary judgmental terms like "fair" and "legitimate" and "biased."

[−] tomasphan 39d ago
I answered 8/10 correctly but mostly on instinct, for example betting that the Trump tweet is misleading. Opus 4.6 got 9/10 correct. You might need an internal time limit (don't show the user) and some strawberry questions.
[−] tomjen3 39d ago
I love the idea of this. I want it to succeed.

But most of the questions I got... They weren't very good and not just because I got them wrong - I got a bunch of them right that I shouldn't have.

For example, the one about homelessness, where it ends with a guy saying our politicians would rather use the money for genocide.

I downloaded the statement for that reason, got told my vote was correct and then it came up with it was correct only because of the first part of it.

I think you're trying to import statements automatically and I fear that won't work. I also fear that you're gonna get, just crap, to be honest. And your social network doesn't deserve that.

I think your best bet is to look at the kind of questions asked on the LSAT, and just do a bunch of essentially IQ and general-knowledge questions. Take the input from Twitter as inspiration, use it as a template and it might work.

One thing you might consider is wanting to filter out people who can't see past their own political agenda.

You can do that by making enough questions so that you're sure to catch people, no matter what they believe on all the hot issues of the day. This probably isn't as hard as it sounds, there's only going to be seven or eight hot issues.

You pick three of them and you should be pretty certain that you will cover the entire spectrum. So for example, you could make sure to include, pro LGBT, pro abortion and pro guns. You would catch most people on that and then you should exclude them if they cannot see past their blindness.

I hope you make this work, the world needs it.

[−] anon115 39d ago
with fine iterations i think it will get their the idea is their i see it
[−] closetkantian 39d ago
Frankly, I think if you're going to do something like this you're going to need to use well constructed critical reasoning items from graduate level exams like the GRE and LSAT.
[−] kotaKat 39d ago
If I want a dumb jackass nonsensical captcha, I’ll just use hCaptcha.
[−] RugnirViking 39d ago
The questions suck. It's an interesting idea, but basically a worse execution of Cicada 3301.

I'd suggest using questions that are nothing to do with politics, and also actually falsifiable. Estimation questions could be a good proxy. How many grains of sand on the earth, you gotta be right to two orders of magnitude or the like

Logic questions aren't bad, but don't use ones like in your example because they're badly posed questions. (Of course, many people claim this about any kind of logic question etc)

But ultimately the questions don't matter. It's the existence of any barrier at all, to set a required standard of effort, and to set the tone. Do you want yours to be 'smart person'? Or 'agrees with me ideologically'? Right now it's leaning more ideological than perhaps intended

[−] throw_winblows 39d ago
Intellectual?
[−] gertop 39d ago

> Try sample questions here without signing up

It's very gracious of you to let us fill captchas without signing up first.

[−] voodooEntity 39d ago
Funny thing had to laugh :)
[−] littlestymaar 39d ago
You can rename it “conservative circle jerk captcha” and it would be more accurate.
[−] notsound 39d ago
I got a 10 out of 10 because I've seen these strawmans in center-right arguments before. Definitely promotes thinking inside the box; the homelessness question presupposes the most expensive solution (buying the homeless homes) in opposition to annual costs that would probably go down over time. I doubt both figures.
[−] aaron695 39d ago
[dead]