Package Managers need global hooks

(captnemo.in)

17 points | by evakhoury 4 days ago

8 comments

ozim 1 hour ago
```
  Every package install is checked against the threat feed and it raises an exception if we find something malicious being installed  
```
Ok big problem is lots of stuff installed for campaigns wasn't flagged in any feed. If maintainer access is taken over you still don't have any feed info, maybe it will be a bit faster to publish so if maintainer finds out.
Everyone is looking at NPM how bad it is or AUR lately. Those are "free for all anything can happen, any kid can publish" repositories and that's what you get.
No one looks at Debian and is saying "well maybe we should do what they do"...
[-]
- captn3m0 45 minutes ago
  Author here - people are definitely looking at other places. This just happens to be where the attacks are, and gets disproportionate attention as a result.
  Do you have examples of campaigns that weren’t flagged? Everything except xz had a 1 day window and Dependency Cooldowns are super effective against most campaigns for that reason.
  See papers at https://kokkonisd.github.io/ for eg.
jamesrom 58 minutes ago
This is the opposite of "do one thing and do it well" unix philosophy.
You don't need your package manager to invoke your hook. You need _your_ tooling to invoke your hook.
./safely-bump-deps.sh && npm install
Want it global? Use a bash alias.
[-]
- captn3m0 41 minutes ago
  Aliases and pre-hooks are nowhere near the guarantees you want, that’s what I am arguing - not everything is invoked from a blessed shell. Safely-bump-does.sh is also impossibly hard to write because you are replicating _all of the work NPM does in transitive dependency resolution_. Unless you are re-generating the lock file from scratch - it isn’t safe. Just updating package.json isn’t sufficient for eg.
- staticshock 55 minutes ago
  Arguably, npm does one thing, but it does it poorly.
drdexebtjl 3 hours ago
This sounds like a prime new vector for malware, ironically.
[-]
- scott_w 3 hours ago
  My understanding is probably not: the hooks are configured locally, not by other packages automatically, so you’d install and setup the pre-install hooks yourself to check the packages before install/update.
  Can it be exploited? Yes, anything can. But that’s not a reason to not do this if the overall result is better.
- self_awareness 2 hours ago
  And how a malware can use this if it's configured globally in a root:root owned config file?
  [-]
  - drdexebtjl 2 hours ago
    Not all package managers require root.
    But yeah, maybe through an exploit with a narrow reach. Once in, the malware can veto security updates and escalate to full control.
    [-]
    - self_awareness 2 hours ago
      With root, malware can reach out to UEFI anyway, and can do whatever it likes.
JKolios 41 minutes ago
1. There are 5 competing standards.
2. This is clearly unacceptable, so we've created the one standard to unite them all™.
3. There are 6 competing standards.
eqvinox 1 hour ago
System package managers (at least apt & portage) have a whole bunch of hooks. I guess this is talking about language package managers.
TFA is also a bit hazy on what hooks exactly?
[-]
- captn3m0 1 hour ago
  `PreInstall` mainly. But `PreFetch/PreBuild` also for source-repositories, such as AUR helpers.
  homebrew doesn't support hooks as a system package manager: https://github.com/ecosyste-ms/package-manager-hooks as an example.
  I think the packaging ecosystem is varied enough that this should be left for the package managers to decide. Yarn allows dependency resolution and WebFetch overrides in its hooks for eg.
YuechenLi 2 hours ago
This seems to be primarily a problem with NPM, since it's the only package manager that I know of that allows for package authors to essentially run arbitrary post-install scripts silently package install.
Shai Hulud/Mini Shai Hulud happened because of this obvious glaring hole in the system, they even had the script to download an official copy of Bun to spread itself in case the targeted machine has hardened their security. So, the real question is not what other security features does a package manager need, it should be: why does a package manager have the ability to let package authors run arbitrary scripts silently on other people's computer in the first place?
It doesn't really matter how good your security system is if the front door is left wide open for anyone to walk through.
[-]
- nightfly 1 hour ago
  > since it's the only package manager that I know of that allows for package authors to essentially run arbitrary post-install scripts silently package install
  Are you sure? I'm pretty sure .deb and .rpm packages both allow that
  [-]
  - YuechenLi 58 minutes ago
    >Are you sure? I'm pretty sure .deb and .rpm packages both allow that Learned something new today. Thanks.
    I think the other significant issue with the NPM ecosystem that makes it bad in particular is NPM's dependency management is genuinely the worst out of any package manager because of phantom dependency is the default: you can be using a package without ever knowing that you are using it because it is imported implicitly, and the JS ecosystem dependency is so weird that taking down a small package that that a major project depends on cripples the entire JS ecosystem, as shown in left-pad, and launching a cyberattack via npm can be as easy as putting malicious code in any small package that large, popular packages depend on and watch it propagate. This is not hypothetical, it has been done, repeatedly in fact, over the years.
    TS is a good programming language, however NPM is a security nightmare, and somehow the collective reaction of everyone depends on the JS/TS ecosystem seems like a shrug and "oh well, what can you do".
    [-]
    - captn3m0 36 minutes ago
      Package-level hooks are everywhere: https://github.com/ecosyste-ms/package-manager-hooks
      I wrote this in response to the recent AUR attacks. The problem isn’t really too many dependencies - it is that most users cannot be auditing everything they install and we need mechanisms that help users where they are.
      I audit my AUR pkg builds, and I would have likely caught any malware. But so would a Dependency Cooldown or a third-party threat feed. Package Managers should make it easy to build this tooling via hooks.
  - tetha 1 hour ago
    Both certainly do. My own hypothesis on why this isn't a more widespread problem is the speed, or lack thereof, of these ecosystems. By the time a package hits debian stable, it's usually been under scrutiny for a year or more.
- captn3m0 2 hours ago
  (Author here). It isn’t a matter of pre-install hooks. I don’t want known malware on my system irrespective of whether it runs at install-time or not. Pre-install hooks are going away in NPM, but we will have code injected in index.js next.
  Modern package managers are not amenable to letting another script override its resolutions, and that is what needs fixing.
- jiehong 2 hours ago
  I agree with your premise.
  I’d even say perhaps we need a fine grained permission system like Apple provides, but for clis, not just something limited to maintainers of package managers.
  [-]
  - sysguest 1 hour ago
    > perhaps we need a fine grained permission system like Apple provides, but for clis
    well deno has the stuff... but deno's not popular (yet)
- TZubiri 1 hour ago
  pypi/pip are also being hit by a supply chain epidemic.
TZubiri 4 hours ago
>Every package install is checked against the threat feed and it raises an exception if we find something malicious being installed.
So your solution is to reinvent signature based antiviruses, like Norton Antivirus and McAffee?
The problem with these 2000s approaches were that attackers could:
1- Fuzz their payloads so that they are never the same and they don't trigger detection.
2- Offload payload mechanisms so that your monitoring system needs to play cat and mouse. For example, what if the malicious code does wget https://IP/file, will you detect wget commands? Will you scan for whatever looks like a URL? Ok, what if they do "another_package_manager_like_flatpack malicious_package", will your scanner implement all package managers? What if they construct the url? "protocol + "://" + domain + file" surely your global hook thing will notice that is a url and how it is downloaded and inspect those contents as well?
3- The attacker can control the timing and infect every user at the same time, especially if they control the update mechanism of users whose security policy is to keep things patched. Even if the malicious update is not simultaneous, the malicious update can start distribution, and the attack only triggered months later (simultaneously) when enough users have downloaded it (beating latency policies).
The only solution is to do actual work and either write the thing you are trying to offload to the 'open source community, or to actually write it yourself. But of course more work is going to be put into the possibility of a magical easy solution, than on an deteriministic hard solution.
[-]
- captn3m0 1 hour ago
  (Author here). I don’t really care _how and what you decide to do with it_, the post is about package managers giving users the ability to decide.
  Dependency Cooldowns can be implemented with global hooks, git-commit-signing checks can be implemented, LLM-scans can be implemented, someone can run the code in a jail and use the eBPF logs to publish a threat feed.
  Modern language packaging is also _source available_, and we have a huge leg up over traditional virus scans - we have the source code almost always. You can do amazing static analysis.
  Yes, it’s hard work. But package managers are doing it already. Yay and Paru both now support hooks. I’m offering to help for AUR to publish more metadata: https://lists.archlinux.org/archives/list/aur-dev@lists.arch...
- oefrha 3 hours ago
  That’s just a wall of text for “malware detection is hard, write everything yourself, don’t use third party”. Thanks for the insight, I guess.
  [-]
  - TZubiri 2 hours ago
    >Malware detection is hard Hell yeah
    >Write everything yourself, don't use third party
    No, you are exaggerating my point of view so that it's easier to dismiss and so you don't have to evaluate the proposition.
    A mix of a Strawman and a false dilemma.
    "Write more and use less third party, than you are currently using." would be more accurate.
    Consider this, the package manager I use has not been infected in over a decade, the package manager you are suggesting improvements for is currently distributing malware as we speak.
    Doesn't that invite you more to learn about our ways? It takes effort, especially if you consider what I'm writing to be a wall of text. But unless you consider 'shipping faster' to be a worthy tradeoff for cybersecurity, then it's worth it to learn, no?
    [-]
    - weinzierl 2 hours ago
      "Consider this, the package manager I use has not been infected in over a decade [..]"
      Which package manager do you mean?
      [-]
      - lelanthran 26 minutes ago
        >> "Consider this, the package manager I use has not been infected in over a decade [..]"
        > Which package manager do you mean?
        Apt, rpm.
        Anyway, don't you think that this line:
        > Which package manager do you mean?
        Gives off "'No way to prevent this problem', says users of only package manager that has this problem" vibes?
- self_awareness 2 hours ago
  These are not 2000 approaches, these are approaches used today (signature based detection).
  The difference is that in 2000s the signatures were written by hand and described static file info, today they're often autogenerated and describe the system behavior, either by looking at one executable, or a whole network of computers. But it is still signature based detection. Since they describe the program behavior, not the program structure, then if the program itself stayed the same (the sequence of system api calls stayed the same), no runtime packing/obfuscation makes a difference to a signature. Unless obfuscation changes the behavior.
  Also security is not binary, it's layered. Sometimes we can address an attack vector by using multiple levels. And sometimes it's simply worth checking for low hanging fruits if only to make the attack more expensive. The "cat and mouse" game is always about the cost of attack and cost of defense, if we raise one then we win in this area, unless the other party finds a way of lowering the cost of their side. Or unless they pay an unexpected amount of cost, for example in state sponsored malware.
  By the way, some security solutions also have actual parsers for example for PowerShell, so they can actually detect string concatenation that constructs the URL.
  [-]
  - TZubiri 10 minutes ago
    >Also security is not binary, it's layered. Sometimes we can address an attack vector by using multiple levels. And sometimes it's simply worth checking for low hanging fruits if only to make the attack more expensive.
    Defense in depth and multiple redundant layers is a legitimate technique.
    But it should usually be 2 virtually unbreachable layers, like 99.999% effective, because there may be a memory leak somewhere and it will be exploited once per year or so. Because 1 layer is just shy of 100%, 2 layers is already overkill, we would do 1.1 layers if possible (but of course it's not)
    Maybe if it's a multibillion dollar product you'd add a third or more impenetrable layers.
    And once you have those two redundant security layers, then you add deterrents and monitors, if you want.
    But relying on security measures that will maybe catch 20% of attacks or 40% of attacks as a main mechanism will lead to what some have described as swiss-cheese models. https://en.wikipedia.org/wiki/Swiss_cheese_model
    So yes, sometimes you can add a cheap measure that might make things more difficult to an attacker, on top of your main line of defense, then do it. But keyword cheap, if it takes 2 minutes like using a custom port, yeah for sure do that, but if you have to write an article and invent a new theory of hooks, then it's not it.
    Antivirus techniques are not used by serious development professionals that I know of, I've seen it used in IT contexts were there's non technical users that might download stuff, but even then it's not really a central technique, just something that security firms sell to non-technical execs. And it's in the context of sys admins trying to control the security environment of hundreds of employees whose actions they cannot control. In the context of developers who have to ensure the security of systems they have code for, you do not need these faith based techniques, you are importing the code, you have access to it, and you have the time resources to inspect it. These antivirus technologies are used either by personal end users that don't have the ability to verify the software or audit its source, or for IT departments that cannot even inspect the contents and packets of the software their employees are using for security reasons.