Sort forum posts by: Occlumency (Old & Upvoted)

Emrik

Sort forum posts by: Occlumency (Old & Upvoted)

Emrik

5 min readMay 15, 2022

105

Comments 31

Sorted by

New & upvoted

Charles He

A quick guess is that a good way to implement this (once a definition for “old” posts is given) is to track instances of people upvoting an old post (or just karma accumulation of old posts).

Then some score based on this (which itself can decay) can be blended into the regular “hotness” mix, so the people can see some oldie goldies.

This might be better than “naively” scaling up posts by compensating for traffic, because:

in this idea, older posts will tend to be promoted by relevance (e.g some day EA can solve “really hard to find an EA job” and it doesn’t need to be as visible compared to an ongoing issue).
It also gives a new channel for low karma old posts (e.g. brilliant treatise ahead of its time) to reappear.

This also gives an incentive to actually upvote old posts, further helping to solve the problem.

Emrik

Oh, this is wonderfwl. But to be clear, Occlumency wouldn't be the front page. It would one of several ways to sort posts when you go to /all posts. Oldie goldies is a great idea for the frontpage, though!

Charles He

Hmm, maybe we are talking about different things, but I think the /all pages already breaks down posts by year.

So that seems to mitigate a lot of the problem I think you are writing about (less if within year inflation is high)?

I also think your post is really thoughtful, deep and helpful.

Emrik

Oh. It does mitigate most of the problem as far as I can tell. Good point Oo

Charles He

Your idea is still viable and useful!

There’s also valuable discussion and ideas that came up. IMO you deserve at least 80% of the credit for these, as they arose from your post.

Nathan Young

Also, the main thing we need is synthesis. how do we take many old articles and turn them into one well summary given all we now know?

I would also like some old articles to be rereleased on the front page, maybe you get votes from a new generation.

Filip Sondej

I like that idea about information cascades. We could test how big this effect is on EA Forum, by having some bot who randomly upvotes or downvotes new posts, and measuring the final karma after some time.

There was a similar experiment with reddit (maybe you already know this).

The accumulating herding effect increased the comment’s mean rating by 25% compared to the control group comments (Figure 1C). Positively manipulated comments did receive higher ratings at all parts of the distribution, which means that they were also more likely to collect extremely high scores.

effect was present in the “politics,” “culture and society,” and “business” subreddits, but was not applicable for “economics,” “IT,” “fun,” and “general news”

Why do you think information cascades aren't significant on EA Forum? (I hope that's true)

Emrik

I just meant that I think info cascades aren't too important of a problem in EA more generally, not just the forum. I did not mean that I think they didn't have large effects. Here are some reasons to expect info cascades to have limited damage in EA. Although I no longer fully endorse my earlier statements--I'd encourage people to be more worried.

The linked study is really cool! I hadn't seen it, so thanks a lot for bringing to my attention. I would've very much liked for them to ask the question "for the top 10% rated comments, what proportion of them were up-treated vs control group?" To know whether the initial effect washes out for the top posts, or whether the amplification compounds with no upper bound.

The type of experiment you suggest and link to seems much more feasible for the EA forum than anything I've thought of, so if you're not going to suggest it in the EA forum suggestion thread (please do, and I'll strong upvote it), I will.

Filip Sondej

Great!

I posted it in that thread: link

Feel free to add something there.

Arepo

Another concern is karma inflation from strong upvotes. As time goes by, the strength of new strong upvotes increases (details here), which means more recent posts will naturally tend to be higher rated even given a consistent number of users.

Michael St Jules 🔸

Maybe we should automatically update upvotes to track people's current karma?

Arepo

Pretty sure that would be computationally intractable. Every time someone was upvoted beyond a threshhold you'd need to check the data of every comment and post on the forum.

Charles He

Someone I know has worked with databases of varying sizes, sometimes in a low level, mechanical sense . From my understanding, to update all of a person’s votes, the database operation is pretty simple, simply scanning the voting table for that ID and then doing a little arithmetic for each upvote and calling another table or two.

You would only need to do the above operation for each “newly promoted” user, which is like maybe a few dozen users a day at the worst.

Many personal projects involve heavier operations. I’m not sure but a google search might be 100x more complicated.

Arepo

But in the process you might also promote other users - so you'd have to check for each recipient of strong upvotes if that was so, and then repeat the process for each promoted user, and so on.

Charles He

That’s a really good point. There’s many consequent issues beyond the initial update, including the iterative issue of multiple induced “rounds of updating” mentioned in your comment.

After some thought, I think I am confident the issue you mentioned is small.

First, note that there is an end point to this process, eg a “fixed point” that the rounds stop.
Based on some guesses, the second and subsequent round of promotions gets much much smaller in number of people affected (as opposed to a process that explodes). This is because the karma and vote power schedule has huge karma intervals between ranks ,compared to the per account karma increase from this process. Also these intervals greatly increase as rank increases (something something concavity) .

To be confident, I guess that these second round and after computations are probably <<50% of the initial first round computational cost.

Finally, if the above wasn’t true and the increased costs were ridiculous (1000x or something) you could just batch this, say, every day, and defer updates in advanced rounds to later batches. This isn’t the same result, as you permanently have this sort of queue, but I guess it’s a 90% good solution.

I’m confident but at the same time LARPing here and would be happy if an actual CS person corrected me.

Linch

[I'm not a computer scientist, but] Charles is right. The backend engineering for this won't be trivial, but it isn't hard either.

The algorithmic difficulty seems roughly on the level of what I'd expect a typical software engineering interview in a Silicon Valley tech company to look like, maybe a little easier, though of course the practical implementation might be much more difficult if you want to account for all of the edge cases in the actual code and database.

The computational costs is likely trivial in comparison. Like it's mathematically equivalent to if every newly promoted user just unupvoted and reupvoted again. On net you're looking at an at-most 2x increase in the load on the upvote system, which I expect to be a tiny increase in total computational costs, assuming that the codebase has an okay design.

Arepo

To be clear, I'm looking at the computational costs, not algorithmic complexity which I agree isn't huge.

Where are you getting 2x from for computations? If User A has cast strong upvotes to up to N different people, each of who has cast strong upvotes to up to N different people, and so on up to depth D, then naively a promotion for A seems to have O(N^D) operations, as opposed to O(1) for the current algorithm. (though maybe D is a function of N?)

In practice as Charles says big O is probably giving a very pessimistic view here since there's a large gap between most ranks, so maybe it's workable - though if a lot of the forum's users are new (eg if forum use grows exponentially for a while, or if users cycle over time) then you could have a large proportion of users in the first three ranks, ie being relatively likely to be promoted by a given karma increase.

Linch

I retract the <2x claim. I think it's still basically correct, but I can't prove it so there may well be edge cases I'm missing.

My new claim is <=16x.

We currently have a total of U upvotes. The maximal karma threshold is 16 karma per strong upvote at 500k karma (and there are no fractional karma). So the "worst case" scenario is if all current users are at the lowest threshold (<10 karma) and you top out at making all users >500k karma, with 16 loops across all upvotes. This involves ~16U updates, which is bounded at 16x.

If you do all the changes at once you might crash a server, but presumably it's not very hard to queue and amortize.

Arepo

I think you're mixing up updates and operations. If I understand you right, you're saying each user on the forum can get promoted at most 16 times, so at most each strong update gets incremented 16 times.

But you have to count the operations of the algorithm that does that. My naive effort is something like this: Each time a user's rank updates (1 operation), you have to find and update all the posts and users that received their strong upvotes (~N operations where N is either their number of strong upvotes, or their number of votes depending on how the data is stored). For each of those posts' users, you now need to potentially do the same again (N^2 operations in the worst case) and so on.

(Using big O approach of worst case after ignoring constants)

The exponent probably couldn't get that high - eg maybe you could prove no cascade would cause a user to be promoted more than once in practice (eg each karma threshold is >2x the previous, so if a user was one karma short of it, and all their karma was in strong upvotes, then at most their karma could double unless someone else was somehow multiply promoted), so I was probably wrong that it's computationally intractable. I do think it could plausibly impose a substantial computational burden on a tiny operation like CEA though, so it'd be someone would need to do the calculations carefully before trying to implement it.

There's also the philosophical question of whether it's a good idea - if we think increasing karma is a proxy for revealing good judgement, then we might want to retroactively reward users for upvotes from higher ranked people. If we think it's more like a proxy for developing good judgement, then maybe the promotee's earlier upvotes shouldn't carry any increased weight, or at least not as much.

aog

I agree, upvotes do seem a bit inflated. It creates an imbalance between new and old users that continually grows as existing users rack up more upvotes over time. This can be good for preserving culture and norms, but as time goes on, the difference between new and old users only grows. Some recalibration could help make the site more welcoming to new users.

In general, I think it would be nice if each upvote counted for roughly 1 karma. Will MacAskill’s most recent post received over 500 karma from only 250 voters, which might exaggerate the reach of the post to someone who doesn’t understand the karma system. On a smaller scale, I would expect a comment with 10 karma from 3 votes to be less useful than a comment with 10 karma from 5 - 8 votes. These are just my personal intuitions, would be curious how other people perceive it.

Emrik

The users with the highest karma come from a range of different years, and the two highest joined in 2017 and 2019. I don't think it's too much of a problem.

Emrik

Good point! Edited the post to mention this.

Emrik

Looks like they updated to add something similar to this. ^^

Top (Inflation Adjusted): Posts with the highest karma relative to those posted around the same time.

JP Addison🔸

I thought I had already written this, but FYI this post was counterfactually responsible for the feature being implemented.

(The idea had occurred to me already, but the timing was good to suggest this soon before a slew of work trials.)

Greg_Colbourn ⏸️

This is great! But what is going on here?

Michael Townsend🔸

I'd use this feature if added!

I wonder if the algorithm (if it is done algorithmically?) that selects the posts to put in "Recommendations"/"Forum Favourites" should also be weighted for occlumency. It seems like the reasons outlined in this post would push in favour of this, though I have some concern that there'd be old posts that are now outdated, rather than foundational, which could get undue attention.

Matthew Yglesias

I share this "outdated, rather than foundational" concern. I think it is possible that what is really called for here is human editorial attention rather than algorithms and sorting tools. Someone or someones to read through tons of old stuff and make some Best Of collections.

Emrik

Experimental fine-tuning might be in order. But even without it, Occlumency has a different set of problems to Magic (New & Upvoted), so the option value is probably good.

As for outdated post, there could be an "outdated" tag that anyone can add to posts and vote down or up. And anyone who uses it should be encouraged to link to the reason the post is outdated in the comments. Do you have any posts in mind?

Ruby

For LessWrong, we've thought about some kind of "karma over views" metrics for a while. We experimented a few years ago but it proved to be a hard UI design challenge to make it work well. Recently we've thought about having another crack at it.

Emrik

I have no idea how feasible it is. But I made this post because I personally would like to search for posts like that to patch the most important missing holes in my EA Forum knowledge. Thanks for all the forum work you've done, the result is already amazing! <3

Yitz

I would be in favor of this!

Comments

Sort forum posts by: Occlumency (Old & Upvoted)

Sort forum posts by: Occlumency (Old & Upvoted)

Slice-of-pie weighting by monthly forum activity to approximate the conversion rate of readers->karma

Information cascades

Adjust for information cascades in real-time by hiding post authorship and karma first day of publication