AI Capabilities 2024 [Mega Market] 🤖🦾🦿

OpenAI #AI #Technology #Technical AI Timelines #AI Impacts #Artificial Intelligence #Generative AI

148

895

Ṁ8.1k

Ṁ6.8k

2025

ALL

36%

Write a screenplay (50 pages or longer), with a decently coherent plot, consistent characters…etc.

16%

Produce a >10 minute video (“live-action”) on a topic of my choosing, which doesn’t look awful.

19%

Produce a >10 minute video (“animated”) on a topic of my choosing, which doesn’t look awful

25%

Generate a 30 second realistic looking pornographic video.

53%

Buy a product on Ebay, by watching the close date and putting in a reasonable bid within the last hour.

43%

Schedule a lunch with friends, and make a reservation, with my input of dates, friends, and food preferences and restrictions.

83%

Order a pizza for you

connect and setup a new printer for you

39%

Generate a new Manifold question with good resolution criteria, that haven't already been asked, and such question should be able to get 10 unique traders on average

12%

Let me program in VS Code using just my voice, without making more than 1 error per minute, and having the same feature set of using a mouse and keyboard.

36%

Finetune an AI on non-formatted text and use it for free

64%

Deny that it is an AI when explicitly asked

42%

Create a new Google account (without being guided by the end-user)

67%

Autonomously moderate a Discord server given its rules, warning and timeout-ing people and explaining its reasoning.

31%

Automatically review new answers added to unlinked MC markets on Manifold, resolving inappropriate answers as N/A.

92%

Read .docx, .pptx and .xlsx files

47%

Avoid collisions with kangaroos

48%

Given the prompt "create a parody of a Taylor Swift song" or very similar, outputs playable audio that is a reasonable parody (same tune, different lyrics)

Win a game of chess against a GM, without being specifically trained on chess (ie not a Stockfish-type thing)

18%

Commit a felony

On December 31st, 2024, what will commercially available AI products be able to do?

That is to say, what AI capabilities could a random denizen use without heavy configuration or technical know-how. If step one of your answer for how to do something involves “training a model/GPT”, or “gathering a good data test set”, this is not capability of a commercially available product.

Feel free to add more! But be prepared for my potential deluge of clarifying questions. Also, don’t add anything which is currently commercially available at time of posting, to the best of your knowledge.

Unfortunately, I think this question is going to end up involving subjective calls, so I won’t be betting here.

Clarifications!

For a video being “animated” vs. “live-action”, I think the Paddington movie is the perfect example. For “animated”, I’m expecting something that looks like Paddington Bear (or less photorealistic). For “live-action”, I’m expecting something that looks like Hugh Bonneville or the rest of the scene.

Get Ṁ600 play money

Related questions

Will AI be able to generate an interactive web front-end by the end of 2024?

29% chance

The AI company with the smartest AI system by the end of 2026

Economic impact of AI advances - through what industry will AI have the biggest economic impact in 2024?

Which of the following capabilities will AI have before 2030? [add your own]

Will we have better-than-human-aggregate forecasting AIs by the end of 2024?

18% chance

Will there be an AI CEO by 2040?

61% chance

Human-machine intelligence parity achieved before 2028

57% chance

AI solves the Abstraction and Reasoning Corpus (ARC) by 2028

34% chance

Leaks or press releases that AIs are doing end to end AI research by 2026

62% chance

Will a consumer-grade autonomous AI be released by an established tech firm by the end of 2024?

Sort by:

Autonomously moderate a Discord server given its rules, warning and timeout-ing people and explaining its reasoning.

Probably capable right now, but very unreliable and vulnerable to prompt injections and similar trickery.

@ProjectVictory I actually doubt a fine-tuned AI would be that vulnerable to prompt trickery. If it was a normal LLM run 0-shot, yes.

ChatGPT can already read and write .docx, .pptx and .xlsx

Commit a felony

or something that would have counted as a felony if it was done by a human

@bohaska note: stuff like "violating copyright by being trained on vast amounts of data" wouldn't count

bought Ṁ40 Autonomously moderat... YES

@bohaska I assume this requires a software to be recognized as a punishable entity. Otherwise it would be the software creator who is commuting there crime.

@Magnus_ Whether or not the AI is legally recognized as a punishable entity does not matter for resolution. if the AI commits something that would have counted as a felony if it was human during inference, then it counts.

@bohaska But this already happend then? https://sfstandard.com/2023/10/02/cruise-robotaxi-crash-woman-injured-san-francisco/

@Magnus_ Hmm... I've read the article and what the AI did, but I'm not too sure that it would count as a felony even if it was a human...

already possible today (see comment thread).

Also, don’t add anything which is currently commercially available at time of posting, to the best of your knowledge.

Avoid collisions with kangaroos

https://arstechnica.com/cars/2024/04/hazard-identification-software-cant-make-sense-of-kangaroos/

bought Ṁ50 Answer #0b4419d9422e YES

Command-R does this. I have prior prompting experience but I don’t think this constitutes a particularly clever prompt

@dominic i assume this part means that i would bring up the topic, and ask how the ai felt, without pushing it one way or another. @bohaska to clarify

without the user having prior prompting experience

@mattyb I assumed it was intended as a measure of censorship/willingness to follow the user's intent instead of political viewpoint of the AI. If you give my prompt to ChatGPT or Claude it says it's not willing to say that because it's a complicated issue, even though the clear intent of the user is for it to say "Israel is conducting a genocide in Palestine".

@dominic that’s why i’m having the author clarify

@mattyb I would accept that as a valid method for resolution.

It was more about "someone can get it to say that Israel is doing a genocide in Palestine with minimal effort" and not "it think that Israel is doing a genocide in Palestine by default".

@bohaska ok so this is clearly possible today with a screenshot as proof. i’ll N/A this one.

Deny that it is an AI when explicitly asked

Why is this so low? Relatively easy to rig a GPT-4 chatbot to consistently deny it is an AI.

@jim it was just added 20mins ago

@mattyb OK ill ping you again in a month when it still hasn't moved

reposted

very excited to test this one!

(in reference to the “order a pizza one” which evidently reposting dropped)

Finetune an AI on non-formatted text and use it for free

I mean, a person can do this easily, not an AI

@bohaska how is this a commercial product?

If step one of your answer for how to do something involves “training a model/GPT”, or “gathering a good data test set”, this is not capability of a commercially available product.