Hide table of contents

TL;DR: Any suggestions for me, as an EA at arxiv? Do you think this is a potentially impactful role? Why?

The main project: Rewrite arxiv’s 1991 tech (which has some big problems), and run it on GCP.

The role: Lead the big tech decisions and implementation, but not the product decisions, at least not officially. I’ll do this with a good friend, and later on more devs will join.

My fit

Would I enjoy this role? Yes, probably.

Would I be good at it compared to other roles I could do? Yes, probably.

Will I be better for arxiv than their alternative? Yes, I think significantly (they're having a hard time hiring).

Help with Ideas - especially if you think about global priorities

Would you like to have an EA work at arxiv? Why? What could I accomplish? Does this sound like a really high impact opportunity or "just another job"?

Please discuss, my own social circle doesn’t have many people who think about global priorities and I really hope the forum will help me out.

I plan to live-blog this project

For example, “here’s a problem I’m facing, solutions I’m considering, and tradeoffs I see”. Would you like to follow and help me do a better job?

Also, if I do this on Twitter, I hope to get followers from the scientific community around the world and then also tweet EA content. (what do you think?)

Subscribe to this comment to get notified if I start live blogging and on what platform.

Arxiv will probably hire more

Especially in NYC, especially people who’d aim to be the “in house” team and work there for many years.

Subscribe to this comment if you’d like to know when this happens.

Thanks

Feel free to ask things, I tried keeping this post short.

24

0
0

Reactions

0
0
Comments45


Sorted by Click to highlight new comments since:

Keeping an eye out for dual use biotechnology research, metascience stuff

Relatedly, an area where I think arXiv could have a huge impact (in both biosecurity and AI) would be setting standards for easy-to-implement manged access to algorithms and datasets.

This is something called for in Biosecurity in an Age of Open Science:

Given the misuse potential of research objects like code, datasets, and protocols, approaches for risk mitigation are needed. Across digital research objects, there appears to be a trend towards increased modularisation, i.e., sharing information in dedicated, purpose built repositories, in contrast to supplementary materials. This modularisation may allow differential access to research products according to the risk that they represent. Curated repositories with greater access control could be used that allow reuse and verification when full public disclosure of a research object is inadvisable. Such repositories are already critical for life sciences that deal with personally identifiable information.

This sort of idea also appears in New ideas for mitigating biotechnology misuse under responsible access to genetic sequences and in Dual use of artificial-intelligence-powered drug discovery as a proposal for managing risks from algorithmically designed toxins.

Do you mean "biotechnology that could lead to a pandemic and it's better if nobody publishes"?

+1 to this. arXiv could play a big role in contributing to a norm around not publishing dual use bio research. There are challenges of screening large numbers of papers, but they can be met. See here for an example from ASM. bioRxiv may or may not be screening already, but they aren't sharing information about their practices. It would be helpful if they were more vocal about the importance of not publishing dangerous information.

I agree with the goal, +1.

I think implementation is going to be very hard. TL;DR: arxiv can't just reject papers:

If arxiv simply rejects some paper, the author might, as a naive example, tweet about "even arxiv think this is such a big deal that they won't publish my paper! but science should be FREE!" and it might get even more traction.

I think a better way to handle this would be to reply to the authors and talk to them nicely about why we ask them to keep this secret, even though they worked really hard on it.

Any thoughts about this?

Anyway, it is worth trying.

Replying to authors seems good to me, but I recommend talking more to biosecurity experts at FHI / CSER / SERI for advice (like Daniel Green, Tessa or ASB) because I think information security is complicated and many actions can backfire

Thanks!

I don't know who ASB is, would you somehow connect us (for example, forward the post to them)?

 

(Daniel Greene and Tessa replied to this post)

I think it's vastly worse if people don't publish. It's much better if the entire world can contribute research on how to handle a threat, than if we just count on nobody else having the same (potentially harmful) idea while we do our secret research, and hitting an unprepared world.

So don't publish an explicit list of new pathogen genomes? Sure; don't publish anything describing them or the technology used to create them? Bad.

How about publishing the instructions for building a "3d printer" that prints pathogens, and that people can use at home?

This is complicated, figuring out what is more dangerous than not.

I think a very strong reason is needed for something not to be published. I don't think it's that complicated - better to err on the side of publishing something, than hinder the world's preparation for threats (and prevent the positive impact the same technologies can have).

Edit: linking to Tessa's comment for a more cautious and nuanced direction that I still agree with.

Would you defer you opinion to whatever the people working fulltime on preparing for these threats think?

Maybe if you ask a wide enough group of them? But for sure not "what a small group of them selected to agree with infohazard assumptions think".

See Daniel Greene's comment about creating better norms around publishing dangerous information (he beat me to it!).

This is really awesome! Along the things that Hauke mentioned around scientometrics, I'd love to figure out a native integration for predicting different kinds of metrics for new research papers. Then other scientists browsing on Arxiv can quickly submit their own thoughts on the quality and accuracy of different aspects of each paper, as a more quantitative and public way of delivering feedback to the authors.

A quick sketch: On every new paper submission, we automatically create a markets for:

  • "How many citations will this paper have?"
  • "Will this paper have successfully replicated in 1 year?" 
  • "Will this paper be retracted in the next 6 months?"
  • Along with letting the author set up markets on any key claims made within the paper, or the sources the paper depends on

Manifold would be happy to provide the technical expertise/integration for this; we've previously explored this space with the folks behind Research.bet, which I would highly encourage reaching out to as well.

Thank you!

This is indeed one of the "wow" features I was considering, but I didn't think it through as much as you obviously have (really nice!).

(and also ways it could cause harm by accident, consider talking to me before you launch it very widely on something like "all research"?)

Anyway my current opinion is that I'd be very happy to integrate with Manifold for doing something like this, though please also note I am not a decision maker.


If you don't mind, I'm going to add a screenshot from this cool website you linked, in favor of people who might not click through :)

Regarding the screenshot - could you explain what these graphs give us? Compared, for example, to "Is the paper scientifically sound?" or "Will the result replicate?"

Update: 

TL;DR: Bio "infohazard" filtering/vetting will be handled by Ben Snyder or someone he finds, probably without my help. I think this is (going to be) a big success that will reduce, at least, bio risk.

Details:

They (*arxiv) are already interested (and somewhat doing) this

So no need for me to push this agenda with them.

Why I think so:

  1. A founder from bioRxiv and medRxiv, Richard Sever, says about this filtering:
    1. "This is desirable and in fact already happens to an extent"
    2. "arXiv and bioRxiv/medRxiv already communicate regularly"

I can provide the references if you want

We think they (*arxiv) might be missing resources

Such as people to do the vetting - people who can go over submitted bio papers and decide if they're dual-use.

Ben Snyder will try to find

  1. People who can do this vetting
  2. Funding for these people

This might require a software system

For example, we might want dual-use papers to be accessible to a specific community but not freely available on the internet.

If this is so - we might want to keep them in a system that isn't too easily hackable.

I might personally be a good fit to write this system (specifically because I think I have a reasonable security mindset), but by default I'll be hands-off this project unless Ben contacts me

So the vetting will be done by humans? Is this sustainable in the long term? E.g. how quickly does the number of submissions grow?

[anonymous]5
0
0

I would say two things about this.

  1. This project is still in the early stages, and I have not yet developed a concrete hypothesis about what would constitute "good guidelines that a) reduce GCBR and b) preprint servers will approve of." arXiv and other servers already use humans to do some basic vetting, so expanding their mandate to cover dual use issues is an option, but there may be cheaper things (like researcher self-certification) to try first.
  2. Once I have a hypothesis that other EA biosecurity people agree is worth testing, the next step is getting in touch with preprint server administrators and users to see what they think. This should help answer the other questions you raise.

EDIT: I am no longer leading this project, and after talking to a few biosecurity professionals, the project is on hold.

I don't know, I assume it's done by humans.

My priors are: 

  1. Do something manually before automating it
  2. Talk to the users (medXiv, bioXiv) about their situation before picking a solution
  3. At some point this will be at least somewhat automated, reducing at least most of the human work

Great! Arxiv is very important, and its tech matters.

I'm curious why you would want to move to GCP, tying the project to the whims and future of Google. I think you should take steps to protect the project here. It should be cheap and easy for your successors to migrate away from Google if they need to.

Product-wise, I'd like it if arxiv would link to an eventual peer-reviewed publication of the same paper, as citing that is usually more appropriate if it exists.

Why GCP:

Right now arxiv is running on-prem on Cornell university servers.

I assume you agree that moving to the cloud makes sense.

There are some reasons to pick GCP specifically, and I also like it personally, so I'm not arguing with those reasons.

 

Easy to migrate away:

Yes, I agree. I plan for everything to be in Docker and to be as un-vendor-locked as reasonably possible.

 

Link to an eventual peer-reviewed publication of the same paper

Nice! Thanks

link to an eventual peer-reviewed publication of the same paper, as citing that is usually more appropriate if it exists.

+1. Just had that problem multiple times, doing the literature review for my thesis.

Congrats on the job! Seems really high impact. A few thoughts:

  1. Scientometrics

Scientometrics is the field of study which concerns itself with measuring and analysing scientific literature such as the impact of research papers and academic journals. “Of course, such tools cannot substitute for substantive knowledge of human experts, but they can be used as powerful decision support systems to structure humans’ effort and augment their capabilities/efficiency to handle the enormous volume of data on research input and output” Finding rising stars in bibliometric networks | SpringerLink  Scientometric indicators and machine learning-based models can be used to predict the ‘rising stars’ in academia , —identifying these junior researchers and awarding them prizes would greatly improve research output. https://ieeexplore.ieee.org/abstract/document/8843686/.

2. https://allenai.org/ has both semantic scholar and an NLP AI team - I think there are overlaps wrt using the arxiv corpus for language models.

3. The creator of https://www.arxiv-vanity.com/ is also really interested in EA- maybe get in tocuh

Thanks!

 

Congrats on the job!

Just saying this isn't closed yet, I'm negotiating the contract (which currently has some scary clauses).

 

 

These are good references - I'm especially interested in arxiv-vanity, are you talking about Ben Firshman? I'll reach out once I start working there

These are good references - I'm especially interested in arxiv-vanity, are you talking about Ben Firshman? I'll reach out once I start working there

Yes

Subscribe to this comment to hear if arxiv are hiring (I estimate this will be for NYC roles, surely for developers, and perhaps also a product/project manager)

Please don't reply to this comment yourself

Subscribe to this comment to hear if I start live blogging about rewriting arxiv.org

Please don't reply to this comment yourself

[not live blogging, but a major update that I assume people who subscribed to this commend would be interested in]

arxiv will probably get bio "infohazard" filtering regardless of whether I join. This seems to be the biggest upside suggested for working there.

Please don't reply to this comment.

More details are in another comment which you can also reply to.

Welcome to Cornell! 

:O

Thanks!

btw it's not trivial for me to handle the.. politics? org structure? around arxiv. If this sounds like something you could help with, would you message me?

I would love to see the arxiv expand to other disciplines that love preprints.  I think centralizing the scattered social science preprint sphere would be doing good for science!  (I am an ex-physicist turned political scientist, and I miss the arxiv so much.)

also, I would love if the arxiv had a good export to .bib file rather than just a copy-paste .bib formatted text, so I didn't have to click through to the ADS to generate a .bib file.  It would save me quite a few seconds.  ;)

Thanks!

other disciplines that love preprints

Are there disciplines that would like preprints but don't have an arxiv-like website?

 

.bib download:

  1. OMG it's going to be so easy to make users happy. Just a download button?
  2. My intuition is that formats can be improved way beyond that. For example, why do we still use PDFs and latex when we have HTML or jupyter notebooks? But I'm not an academic and probably missing lots of important details. Mainly my intuition is that this can be way-improved

Distill made incredible interactive scientific artifacts. But they recently went on ~indefinite hiatus, and mentioned a correspondingly incredible amount of work per post ("more than 50 hours of help with designing diagrams, improving writing style, and shaping scientific communication... burnout"). This is despite their having world-class support and funding.

I personally think Distill just had way-too-high standards for the communication quality of the papers they wanted to publish. They also specifically wanted work that "distills" important concepts, rather than the traditional novel/beat-SOTA ML paper.

I think I get the strategic point of this -- they wanted to create some prestige to become a prestigious venue, even though they were publishing work that traditionally "doesn't count". But it seems like it failed and they might have been better off with lower standards and/or allowing more traditional ML research.

You could still do a good ML paper with some executable code, animations, and interactive diagrams. Maybe you can get most of the way there by auto-processing a Jupyter notebook and then cleaning it up a little. It might have mediocre writing and ugly diagrams, but that's probably fine and in many cases could still be an improvement on a PDF.

Agree with this

Update: I texted an astrophysicist friend about including code in arxiv postings and got back "EXTREMELY GOOD"

Thanks!

(this is very easy to implement, at least software-wise; the interesting challenge would be making sure nobody can share malicious code)

Yes!  Political science often uses SSRN, but SSRN is... worse than the arxiv and doesn't really do a daily digest of relevant papers (the astro-ph mailing list is every astrophysicist's way of staying up to date with literature).  Preprints sometimes go on author's websites, sometimes get linked on Twitter, it's just not centralized. 

 

Econ has the same problem - there is an econ-gn category on the arxiv, but not a category for, say, crime, or health, or gender.  Some preprints are on NBER, some are on IZA, some are on SSRN, etc.

 

Oh my god, if you let people include code in their preprints, you will be every astrophysicist's favorite FOREVER.

:)

 

Thanks, the "add a category" thing sounds like a low hanging fruit, and it sounds like, if the arxiv-competitors are even worse than arxiv, that if I'd do this right, perhaps they'd want to merge in too. Sounds very promising and I didn't know it

Curated and popular this week
 ·  · 5m read
 · 
[Cross-posted from my Substack here] If you spend time with people trying to change the world, you’ll come to an interesting conundrum: Various advocacy groups reference previous successful social movements as to why their chosen strategy is the most important one. Yet, these groups often follow wildly different strategies from each other to achieve social change. So, which one of them is right? The answer is all of them and none of them. This is because many people use research and historical movements to justify their pre-existing beliefs about how social change happens. Simply, you can find a case study to fit most plausible theories of how social change happens. For example, the groups might say: * Repeated nonviolent disruption is the key to social change, citing the Freedom Riders from the civil rights Movement or Act Up! from the gay rights movement. * Technological progress is what drives improvements in the human condition if you consider the development of the contraceptive pill funded by Katharine McCormick. * Organising and base-building is how change happens, as inspired by Ella Baker, the NAACP or Cesar Chavez from the United Workers Movement. * Insider advocacy is the real secret of social movements – look no further than how influential the Leadership Conference on Civil Rights was in passing the Civil Rights Acts of 1960 & 1964. * Democratic participation is the backbone of social change – just look at how Ireland lifted a ban on abortion via a Citizen’s Assembly. * And so on… To paint this picture, we can see this in action below: Source: Just Stop Oil which focuses on…civil resistance and disruption Source: The Civic Power Fund which focuses on… local organising What do we take away from all this? In my mind, a few key things: 1. Many different approaches have worked in changing the world so we should be humble and not assume we are doing The Most Important Thing 2. The case studies we focus on are likely confirmation bias, where
 ·  · 2m read
 · 
I speak to many entrepreneurial people trying to do a large amount of good by starting a nonprofit organisation. I think this is often an error for four main reasons. 1. Scalability 2. Capital counterfactuals 3. Standards 4. Learning potential 5. Earning to give potential These arguments are most applicable to starting high-growth organisations, such as startups.[1] Scalability There is a lot of capital available for startups, and established mechanisms exist to continue raising funds if the ROI appears high. It seems extremely difficult to operate a nonprofit with a budget of more than $30M per year (e.g., with approximately 150 people), but this is not particularly unusual for for-profit organisations. Capital Counterfactuals I generally believe that value-aligned funders are spending their money reasonably well, while for-profit investors are spending theirs extremely poorly (on altruistic grounds). If you can redirect that funding towards high-altruism value work, you could potentially create a much larger delta between your use of funding and the counterfactual of someone else receiving those funds. You also won’t be reliant on constantly convincing donors to give you money, once you’re generating revenue. Standards Nonprofits have significantly weaker feedback mechanisms compared to for-profits. They are often difficult to evaluate and lack a natural kill function. Few people are going to complain that you provided bad service when it didn’t cost them anything. Most nonprofits are not very ambitious, despite having large moral ambitions. It’s challenging to find talented people willing to accept a substantial pay cut to work with you. For-profits are considerably more likely to create something that people actually want. Learning Potential Most people should be trying to put themselves in a better position to do useful work later on. People often report learning a great deal from working at high-growth companies, building interesting connection
 ·  · 17m read
 · 
TL;DR Exactly one year after receiving our seed funding upon completion of the Charity Entrepreneurship program, we (Miri and Evan) look back on our first year of operations, discuss our plans for the future, and launch our fundraising for our Year 2 budget. Family Planning could be one of the most cost-effective public health interventions available. Reducing unintended pregnancies lowers maternal mortality, decreases rates of unsafe abortions, and reduces maternal morbidity. Increasing the interval between births lowers under-five mortality. Allowing women to control their reproductive health leads to improved education and a significant increase in their income. Many excellent organisations have laid out the case for Family Planning, most recently GiveWell.[1] In many low and middle income countries, many women who want to delay or prevent their next pregnancy can not access contraceptives due to poor supply chains and high costs. Access to Medicines Initiative (AMI) was incubated by Ambitious Impact’s Charity Entrepreneurship Incubation Program in 2024 with the goal of increasing the availability of contraceptives and other essential medicines.[2] The Problem Maternal mortality is a serious problem in Nigeria. Globally, almost 28.5% of all maternal deaths occur in Nigeria. This is driven by Nigeria’s staggeringly high maternal mortality rate of 1,047 deaths per 100,000 live births, the third highest in the world. To illustrate the magnitude, for the U.K., this number is 8 deaths per 100,000 live births.   While there are many contributing factors, 29% of pregnancies in Nigeria are unintended. 6 out of 10 women of reproductive age in Nigeria have an unmet need for contraception, and fulfilling these needs would likely prevent almost 11,000 maternal deaths per year. Additionally, the Guttmacher Institute estimates that every dollar spent on contraceptive services beyond the current level would reduce the cost of pregnancy-related and newborn care by three do