Γαμινγκ the Algorithms: Large
Language Models as Mirrors

Haris Shekeris

Γαμινγκ the Algorithms: Large Language Models as Mirrors

Haris Shekeris

5 min readApr 1, 2023

Comments 3

Sorted by

New & upvoted

Daniel_Eth

Algorithms are not clever in the sense that they don’t make judgments, opinions or choices of their own, they simply absorb input, combine what many others have said, treat those opinions according to set rules and spit out some output based on the input and rules (the functions that reinforce their synapses).

So first off the bat, I'm charitably going to assume that by "algorithms" you mean "LLMs", but even given that assumption, the satement strikes me as either trivially true, or false. Yes, you could say that they simply take inputs and their rules (rules, I might add, that we don't know) to create output, but the same could be said for humans – we behave simply according to our environmental inputs and the rules governing how our brains work (rules which differ somewhat person to person and in the same person across time). Yes, LLMs only operate if you prompt them, but prompting them is easy enough to do, and further this can be automated. Likewise, I'm not really sure what meaningful definition would imply that they don't make judgments, opinions, or choices of their own.

Haris Shekeris

Dear Daniel,

First of all, many many thanks for your time, charity and quickness!! I really appreciate it that you deemed my post worthy of a reply!

Now, as for the reply and the specific points that you raise. First of all, I think I am quite clear and explicit regarding the use of the shorthand LLM and algorithms. Indeed, in the epilogue, I end with the example of the Youtube algorithm, which I believe is an algorithm but not an LLM (please correct me if i'm wrong).

Now, on to your second point. I am puzzled by your assertion in brackets that '(rules, I might add, that we don't know)', are you saying that not even the coders who code LLMs know these rules (in this case I'd use the word algorithms, as the rules would in my poor grasp of the matter, be in the forms of algorithms, such as 'if you get prompt X look into dataset Y etc), or do you mean that the rules are not known to the user? I would appreciate it if you could clarify this for me.

Finally, could you please explain to me what specific 'meaningful definition' your after in your last sentence? I feel a bit lost.

Once again, many thanks for your prompt response, I would love it if my comments elicit another response for you that will allow both of us to reach a synthesis :)

Best Wishes,
Haris

Haris Shekeris

whoops, scrap my previous answer, especially the first point. I now see that you were referring to a specific quote. Let me see.

Ah, yes, you may be right that I may have equivocated in the quote you cite, that it may have been more precise had I used the shorthand LLMs. So thanks for your charity!

However, I would like to point out that the fact that you can find something either trivially true or trivially false, under a binary logic may leave the proposition itself as not trivial at all under a different interpretation, no? I mean it's significant that it is not trivially true, it already has two interpretations. But ok, that's an aside that i'm not interested much in, and I think you may not be interested in it either.

And now your request for a meaningful definition suddenly makes a lot of sense too!!!! I think what I was trying to express is revealed by 'on their own'. I mean that whereas humans (and maybe animals, though not 100% sure, as i state in my caps bold letters, i may be guilty of anthropomorphism) may sometimes do as others do, and at other times do as they please (judge, choose, etc), LLMs only have one of these options (at the time of writing i may have thought that LLMs don't judge-opine etc without prompts - to which of course you can reply that humans always do so too (to which I'd reply that this a) isn't so, humans do sometimes opine unprompted and that b) that i'd rather anthropomorphise in the sense of treating animals as imbued with human traits rather than treat humans as glorified machines. This is a matter of arbitrary (you may say) choice on my part, and I will not offer an argument for it, at least not now - hence the caps bold.

Once again, many thanks for enlightening me and apologies if the first post had misunderstood your comment, i hope now I am more on the ball!

Best Wishes,
Looking forward to an answer from you!
Haris

Comments

Curated and popular this week

What would an animal-aligned AI be aligned to?

Aidan Kankyoku, Anima International·1w ago·Curated 4d ago·15m read

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

191

The first video from Giving What We Can's new channel is out now!

JustinPortela·6d ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

108

Let's taboo the V-word

lincolnq·1d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Recent opportunities to take action

EA Organisation Updates thread: July 2026

Dane Valerie·15h ago·1m read

The EA Opportunities Board now has full-time roles

Agnes Hasselblad 🔸·17h ago·3m read

fatika·2h ago·1m read

^{^}

I wish to thank and express my deep gratitude to an esteemed friend (let’s keep him anonymous for the moment for GDPR reasons, hehe) who has pointed out to me (in other words of course) that what I am describing above is called the ‘inner vs the outer alignment problem’ for the AI community

^{^}

It is worth remarking that the simple reflecting mirror is at least a 2,500 year old (or much more perhaps) technology, whilst the distorting ones perhaps less than 300 years old. So it seems here that truthful reflection has been a more tried and tested and maintained technology, rather than the improving or simply distorting (irrespective of improving/worsening one). Either way, it is furthurmore worth remarking that at least for the mirror example, whay may have begun as a significant and significantly transforming technology – allowing people to see their faces – has perhaps survived so long because its functions are purely aesthetic, rather than epistemic or truthreflecting :)