We need to preserve data. The FBI is trying to kill data.
We can not allow the FBI to work for Evil here. I actually think there should be a human right to data. With that I mean, primarily, knowledge, not to data about a single human being as such (e. g. "doxxing" or any such crap - I mean knowledge).
Knowledge itself should become a human right. I understand that the current law is very favourable to mega-corporations milking mankind dry, but the law should also be changed. (I am not anti-business per se, mind you - I just think the law should not become a tool to contain human rights, including access to knowledge and information at all times.)
Wikipedia is somewhat ok, but it also misses a TON of stuff, and unfortunately it only has one primary view, whereas many things need some explanation before one can understand it. When I read up on a (to me) new topic, I try to focus on simple things and master these first. Some wikipedia articles are so complicated that even after staring at them for several minutes, and reading it, I still haven't the slightest clue what this is about. This is also a problem of wikipedia - as so many different people write things, it is sometimes super-hard to understand what wikipedia is trying to convey here.
> We can not allow the FBI to work for Evil here
Historically speaking I can't see this as even being in the top 100 evil things the FBI has done.
> Historically speaking I can't see this as even being in the top 100 evil things the FBI has done.
Perhaps, but we can't change the past: we can only fight against what is happening in the present to try to get a better future.
Probably a bit of a 'baby with the bathwater' situation here. At almost no point has that institution been a net positive - at times snooping on 'political dissidents' (like MLK Jr.), and at others bungling cases so bad they become moments of national shame (Ruby Ridge).
You're never going to get a system with a clandestine domestic service running ethically for long, esp. not with qualified immunity. It's simply too attractive to dumb psychopaths with delusions of grandeur and concurrently not of interest to people with a strong sense of community or morals.
> At almost no point has that institution been a net positive
Hard to measure, isn't it. In the eyes of the millions of americans who have at some point in their life been victims or related to or friends of victims of some kind of serious crime, the FBI has often times been helpful and/or the prospect of being caught has been a deterrent for crimes.
You contrast that with all the bad that has come from there, of which there is surely plenty, but how come you claim thay the bad obviously must outweigh the good?
> At almost no point has that institution been a net positive
The FBI's anticorruption work is good and necessary.
I assume that’s why the original argument is that it’s not been a net positive. I.e. the assumption is that lots of work can be good and necessary, while even more that is evil and excessive can end up with a net negative.
You really believe it's never been a net positive?
Have you ever worked in LE, in any capacity?
What a laughable thing to say.
Have you ever dealt with US law enforcement? They are a joke. Thinking they are a positive influence is a joke to me.
I'm sure police (when they aren't fighting over jurisdictional issues) find it helpful, that doesn't mean that it's helpful for the population, especially when it (and police generally) are used as a tool for domestic influence operations and to basically shunt some people aside in the name of business and landowners.
Ok then. This, while bad, is not even in the top 50 of evil deeds the FBI is CURRENTLY doing ...
List not fifty, but just ten of those, I'd like to know.
It being #101 or #275 in a ranking doesn’t make it not evil though.
Well, being ranked #275 on the list of 5 things I'm going to care about today means that I'm going to go "hmm that's interesting" and then move on with life.
We shouldn't let people get away with doing minor bad things just because they're not as bad as other bad things we let them get away with.
The tendency to do evil things should be the overriding concern, not the relative ranking of them.
No, but it can make it unworthy of concern, same how an egg gone bad on the kitchen counter is not very important when the house is on fire
If one is naive to the fire in the house, the egg on the counter, as something worthy of concern, might get them to look around and see that their house is not actually suitable for use as shelter. People who are trying to put out the fire (or who are simply concerned about it while watching from a distance) might decide to point to the spoiled egg to spread awareness of the fire to the people inside.
An egg gone bad and a house fire are not the same sort of concern. A government agency doing two types of evil thing is more akin to ranking whether the small fire in the living room is more important than the slightly larger fire in the media room. It’s all fire in your house.
And?
the phrasing makes it sound like historically the FBI does not work for evil. which is somewhat annoying to someone who believes the FBI has been primarily a tool of evil.
I just wish the default way people used archive.is was to generate their long form link instead of their short link, as, if the site ever goes down, all of the links people have posted where they don't change the setting and thereby paste the default inscrutable code link will be destroyed... building a service with a pernicious behavior like that is ALSO not okay in its own way.
where can you change the setting? I was not aware of this.
At the top click "share" and then select the "long link".
http://archive.today/2023.11.30-020758/https://www.theverge....
> Wikipedia is somewhat ok, but it also misses a TON of stuff, and unfortunately it only has one primary view, whereas many things need some explanation before one can understand it.
Last I checked, they had archive.is blacklisted; the people with power there had (as far as I can tell) come to the conclusion that people using that site to prove that websites had stated X on date Y were the bad guys. Of course, they still have archive.org sources everywhere, so the objection is not actually to archiving page content.
Tons of claims also seem to be sourced ultimately to thinly-disguised promotional material (e.g. claims of the prevalence of a problem backed up by the sites of companies offering products to combat the problem) and opinion pieces that happen to mention an objective (but not verified) claim in passing.
> they had archive.is blacklisted
What do you mean by this? Wikipedia actively encourages people to use archive.is links in citations:
https://en.wikipedia.org/wiki/Help:Archiving_a_source#Archiv...
Well, it was last I checked, which may have been some time ago (I couldn't easily find the changes with the WikiBlame tool).
That would be 7 years ago. It indeed was blacklisted on 2014 but restored on 2016.
2016 was 9 years ago, but thanks for the heads-up.
The difference is that we know who's running archive.org. We don't know who's running archive.is. That's perfectly fine for private use but unacceptable for a site like Wikipedia.
It's not that difficult.
Are there examples of archive.is falsifying information? I care more about "what you do" than "who you are".
I never said there are examples. However, "who you are" matters, even if you don't care. At least when the who is known, we can guage the trustworthiness and what bias exists, because there's always a bias. When who is not known, you don't know what bias to account for. That's not trustworthy and not reliable. And when the site is closed source and you have no idea how it's being run, nor by whom, you don't know "what you do" either.
When you don't know who runs archive.is, you can also never know if he sold it to someone else (except if either party publically announces it).
This type of possibility really worries me. Archive.is is much closer to actual history in many ways. If the data there starts getting corrupted or biased, there’s no way to know if what was truly there.
The idea that the permanent record of the internet could hinge on the ethics of one stranger behind a server rack is deeply unsettling.
This happens on .gov websites a lot now, and it is deeply unsettling.
Knowing the current owner of archive.is doesn't help; we need more full, independent Internet mirrors that can be compared against each other.
And when the information on archive.is starts getting corrupted, those links can be adjusted or removed.
Who will catch it, when and how?
Does wiki only allow people to vote sources where the owner is known?
"wiki"?
I never concluded that but this actually allows someone, the anonymous here to change the history/info backwards if needed. For russians as an example this would be powerfull tool to manipulate narrative, which is cultural there at this point. Pretty smart and dangerous if it is really operated by them.
Such practices are by no means limited to Russia.
Absolutely. They we're just example which came to my mind first.
Ignoring the fact that one of this service's primary reason for existing is that corporations and governments are already abusing their ability to retroactively change history.
If this is truly a concern then the answer is to have more than one publicly-accessible independent archive service. Archivebro has never taken any steps towards securing a monopoly on archiving things. The FBI are the only ones doing that.
Also not everybody in Russia is on the FSB payroll. News media always stops investigating as soon as there is credible information that somebody or their server is located in Russia because if they learn too much then it becomes difficult to discredit them as "possibly being linked to the kremlin". If you used any other nationality to imply that somebody is acting in bad faith on behalf of a hostile foreign government without additional evidence those same journos would call you a racist and try to get you canceled.
> We don't know who's running archive.is.
I always assumed it was officially sanctioned by the Israeli government, for some unspecified (but almost certainly nefarious) purpose.
Not sure if your assumption was based on the TLD, but `.is` is Iceland, not Israel (which is `.il`).
Ty to everyone who pointed this out. I have a feeling I've googled it several times, and then promptly forgot. Maybe it will stick this time.
.is is Iceland, not Israel (that's .il).
Why Israel? From the TLD? IS is Iceland. Israel would be IL.
Where did you check this? While neither are listed on WP:RSP, I know many cites are changed into web.archive.org links once they go down.
I heard stories of incriminating stuff for higher-ups disappearing from archive.org.
I heard stories about a potential Oracle data breach (I think mainly affecting their customers) being removed from Archive.org too. It’s because in general, they comply with requests to remove stuff, which is understandable from an ethical perspective. But do they at least try to explain the reason for the takedown? Is it just not feasible to do that?
Archive.org honors robots.txt retroactively. So anybody can take down their own stuff by adding a link to their robots.txt file.
This is no longer true. They changed their policy to ignore robots.txt in 2017. I seem to recall that they still respected robots.txt later, though I can’t find any more information on it and may be misremembering. Currently, they do not.
Does it mean archive.org works for any sites?
My main use for archive.is is for sites that somehow cannot be archived (a message will show up mentioning this site cannot be archive or something along these lines).
archive.is is generally pretty good in forcibly attempting to get an archive, if the HTML doesn't work, the screenshot will work fine. Although archive.is doesn't seem to handle gifs/videos.
> Does it mean archive.org works for any sites?
They respected exclusion requests after they stopped to respect robots.txt. I don't know their policy for new exclusion requests.
Oh. Did not know that.
> Last I checked, they had archive.is blacklisted; the people with power there had (as far as I can tell) come to the conclusion that people using that site to prove that websites had stated X on date Y were the bad guys.
Or they're worried about the paywall by-passing functionality (which is probably what a good portion of people use it for) and copyright claims against archive.today potentially having it taken down and thus breaking a lot of links.
I agree. Knowledge should belong to all of humanity.
But then also don’t be angry at big corporations when they scrape the entire internet.
There's no contradiction in wanting an abolition (or at least substantial curtailment) of copyright while also being upset that mass violations of copyright magically become legal if you've got enough money.
Enforcement being unjustly balanced in favor of the rich & powerful is a separate issue from whether there should be enforcement in the first place—"if we must do this, it should at least be fair, and if it's not going to be fair, it at least shouldn't be unfair in favor of the already-powerful" is a totally valid position to hold, while also believing, "however, ideally, we should just not do this in the first place".
> There's no contradiction in wanting an abolition (or at least substantial curtailment) of copyright while also being upset that mass violations of copyright magically become legal if you've got enough money.
Why can't you just be happy for those few who are lucky enough to be able to violate copyright with no consequences? Yes, I know you'd want everyone to be able to violate copyright, but we're not there yet.
"Why can't we just be happy" that individuals and smaller companies get sued into oblivion over copyright violations, while large AI companies can scrape everyone's data and use it for training and completely ignore copyright while generating code and images and text and music based on all that that displaces the demand for the originals? Is that what you're asking?
Because we’d like the powerful to feel the crunch from bad law rather than get a backdoor, so they have to use their power to change things for everyone instead of just getting it changed for themselves.
More often than not the rich just codify the "backdoor" for themselves in such case. A rich man can buy the $30,000 registered machinegun and pay the $200 NFA stamp and be 100% legal, the poor man who 3d prints a $0.50 of plastic to do the same thing goes to jail for 15 years.
He might be happy for them and also sad because there is no rule of law.
Its the rule of bribery quite frankly. Name it lobbying and nobody bats an eye on it.
The entities training AI are not anti-copyright, or anti-intellectual-property. If I were to steal their AI models they would sue me into the ground and probably win. Furthermore, even if you are anti-copyright, you probably still don't want your shit scraped by AI trainers since the bots are extremely aggressive, almost like a bona fide DDoS attack.
AI is not an attack on copyright, it is an attempt to replace it with something worse.
You're assuming way too much with "not there yet". The point is the corpos will violate copyright with impunity today, and then in a few years sign a bunch of settlement agreements and pull the ladder up behind them.
I'd love to see copyright slowly become irrelevant, but even with that goal we should expect to see large corpos being the last to stop respecting it.
While it's true people are upset at AI companies profiting off of artist creations with no compensation, I know a lot of people are also reacting to how the recent AI companies have been scraping the web. The reason folks are using Anubis and other methods is because unlike Google which did have archiving of sites for a long time (which was actually a great service), these new companies do not respect robots.txt, do not crawl at a reasonable rate (for us, thousands of hits a minute from their botnets - usually baidu/tencent, but also plenty of US IPs), hit the same resource repeatedly, ignoring headers intended to give cache hints, stupidly hitting thousands of variations of a page when crawling search results with no detection that they are getting basically the same thing... And when you ban them, they then switch to residential ranges. It really is malicious.
> AI companies profiting
Are they?
If you boil it down to the AI companies are making money (subscriptions, etc.) based on content they did not pay to produce, then they are profiting from someone else's hard work.
Revenue is not profit.
I didn't say it was. I understand that profit = revenue - cost.
I said they're profiting from other people's hard work, a separate concept.
profiting != profit
'stealing is fine if you lose money when reselling'
I don't believe I wrote anything of the sort.
Do you use google search for work? Then you are profiting from someone else's hard work!
The difference is that when you search on Google - at least before AI overviews - you ende up at the source site.
Also Google respects robots.txt. Every site that Google surfaces chose to be in the index.
Thats not entirely true. Google might or might not hide your pages from index. They'll definitely going to scrape it anyway. They also display summarized info from your page (famous "what is scrapping" joke showing wikipedias summary). Finally, you might just get your answer without visiting - just by skimming result description.
People at these companies are receiving a salary to do these things that the person you're responding to is opposed to.
While not all the companies in question may or may not be profiting from these things some of them are, and most if not all of their employees certainly are as well.
A lot of the outrage isn't at scraping, it is at the disruptive techniques used to do so. Like web-scraping whole websites that already provide convenient images of their content for download.
Feels like now we're just redefining our rules so that the people we don't like are out and the people we like are in. Does the content creator have the right to determine how their work is used or not?
I have a right to my copyrighted work, and I also have a right to set and enforce access rules to a server I operate to grant people access to it.
I don't care that they scrape my website.
I DO care that nearly 2M different IPs are used to try to pull 42k commits out of a git repo one by one when they could just git clone it ...
I wish the companies would just pay a few technically-competent companies to do the scraping. Pay two so you can check their work, maybe, but let's get past the point in time when dozens (or more?) of companies are all simultaneously hammering the web.
There’s perfectly good LLM’s built specifically to shit out swaths of mediocre code to do that, why would you pay anyone?
This is a false equivalency I'm surprised no one else has brought up. An archive of a site preserves attribution inherently, the scraping and training are not.
Is it? I thought it was ridiculous at first, but the more I think of it... both are scenarios where a corporation is scraping billions of webpages. We like the reason archive.is does it, but unless it's some kind of charity, I think it's a reasonable comparison.
archive.is is a charity no? Or at least they take donations, it seems the legal entity behind it is nebulous, but they don't have ads and have no paid product or offering.
They sure as shit do have ads. Have you ever accidentally followed a link using a browser profile that has no ad blocking enabled?
I only rarely browse without some form of content blocking (usually privacy-focused... that takes care of enough ads for me, most of the time). I keep a browser profile that's got no customizations at all, though, for verifying that bugs I see/want to report are not related to one of my extensions.
Every once in a while, I'll accidentally open a link to a news site (or to an archive of such a site) in that vanilla profile. I'm shocked at how many ads you see if you don't take some counter measures.
I just confirmed in that profile: archive.is definitely puts ads around the sites they've archived.
So if OpenAI or <AI scraper of the day> adds attribution to their AI-generated answers, everything is OK?
It would be closer to okay.
One thing is not the other. A corporation is not a human (and no I don't care what Citizens United says). A corporation has no inherent rights.
Big corporations aren't humans.
Corporations large and small don't do anything. It's always a person. The question you are answering, even if you don't think you are, is whether a few people can get together and act in concert and still retain their rights.
they are persons under US law.
Only in a couple of very specific and narrow ways. They are not considered persons generally under US law. They are legal fictions that have been granted a subset of rights that people have.
And that subset of rights keeps expanding.
It would solve a lot if that was taken to the extreme. Sorry Amazon, but your working conditions killed five people. Your business licens is going to jail for 40 years, good luck getting contracts with other companies with murder on your records when you get out.
US law only applies in the US. Plus, the company in question seems to be based in Canada, so outside the FBI jurisdiction
So was Kim Dot Com. Biden went after him anyway at the behest of big media.
I mean, its not like it was just Biden. His extradition proceedings took place during three different US presidential administrations. You might as well include Trump and Obama in there as well.
that's just wishful thinking. US law applies world wide as long as Trump is willing to reach out and nab you. ask the Venezuelan fishermen.
But US law isn't even the law of the world, let alone the definition of reality.
How are you planning on doing anything about it?
Does it need to be? Trump has been assasinating people in boats with no evidence whatsoever.
Hot take here, I know, but some of us believe the law should treat large corporations differently than it treats individuals when it comes to their rights and privileges.
> But then also don’t be angry at big corporations when they scrape the entire internet.
I'm only angry with them when they pay hush money to IP extortionists.
It's not that they're scraping the internet, it's that they're scraping the internet, profiting off the data they take, and still using the copyright regime to go after others who do unto them.
This seems like an incredible disingenuous take. There's a marked difference between collecting information to freely share with the rest of humanity, and collecting information to feed into algorithms under the guise of "artificial intelligence" with the pretense of enriching their finances and putting others out of work.
That's a bad take, just like open source code is available to all, it's not the case you can always resell it or repackage it for your own profit.
Information can be made available to all, and at the same time, we can make it so others cannot resell or repackage it for profit like what AI companies are doing.
Well, as long as they pursue a "copyright for me but not for thee" regime, you can.
> I actually think there should be a human right to data. With that I mean, primarily, knowledge, not to data about a single human being as such
How do you suggest we fund the difficult work needed to investigate, research, and produce such data?
Remember that facts are not copyrightable, and as such, can't be restricted by copyright. Creative expression of those facts, on the other hand, can be.
I've been thinking the Library of Congress should buy Archive.org
It should become part of an entity that is very difficult to kill, and will exist for a long time.
Although I guess that's a function of culture, and I think respect for libraries is rapidly declining.
If the last 9 months have shown us anything, it's that long-running government institutions are a lot easier to kill than we thought. And the idea of archive.org being under the control an administration like the current one in the US is pretty frightening. They would have absolutely zero qualms about deleting and changing that data.
Ditto for the other side of the aisle. We still don't know who was really the president, while everyone pretended Biden was not a dementia patient.
I’d rather not rely on a government owning it that has been very open about their desire to control what people perceive as true
> The FBI is trying to kill data.
For you. I'm sure they love data as long as only they can access it.
> We need to preserve data. ... I actually think there should be a human right to data.
I'm not going to simp for the FBI here, but come on: do you have a human right to preserve my private photos leaked by a stalker or a hacker? Because archive.is is famously unwilling to play nice here.
I don't know if this case is about that, or about pirated content, or about the administration trying to scrub something embarrassing off the internet. But the fact that archive.is cheerfully enables all three "use cases" should probably give you a pause.
It's a delicate line to walk because takedown processes can be abused to do things we don't like. But "lol, tough luck, information wants to be free" is not a sensible blanket response in a polite society.
"I actually think there should be a human right to data."
Interesting tagline, but probably has far too many side effects.
If it were me, I would try to boil this down to some negative right, those usually have less side effects, and even then I would be very careful.
> We can not allow the FBI to work for Evil here.
It's not up to us to tell the FBI what to do, that's a fatal misunderstanding about how power works. You can demand to see the FBI's manager, but I doubt it will get you anywhere. You can choose between two candidates offered by the privately owned and run political parties for whom the FBI works, but I don't think that will help either.
> Knowledge itself should become a human right.
Human rights are created by legislation. Unless you own a legislator (or rather, many legislators), you will not be involved in this. The people who own (and parcel out) knowledge itself, however, will be involved.
It would be better if we stopped making pronouncements about what people more powerful than us should be doing. It's like prisoners talking about what the jail should be doing. You should talk about what you should be doing. And don't mistake demanding for doing, or walking in the street with your friends for activism (unless you're violating curfew and are prepared to defend yourselves.)
Be brave. Put forward a program that might fail. Ask people to help you with it, ask them to follow you, tell them where to show up. Join someone else and help with their program. Don't demand, then whine when they say "of course not." The FBI is not your daddy, and the people running it are not running it on your behalf.
I don't mean to be personal, but this type of talk is empty. The way how to do things is decided is through power; and the way weak people exercise power is collectively, through discussion and coordinated action. Anybody can talk about what they would do if they were dictator of the world.
All you are guarding against here is some bits in a machine. Knowledge can be embedded in other substrate, other medium. Acquired by more actions than reading social media.
IMO what you really mean is "I should be free to sit and surf the web secure in my belief others are acting properly, while subsisting on externalized labor that props up my biology".
Asimov and countless others highlight this difference between being a passive reader of others ideas as orthogonal to knowledge acquisition. If you aren't conducting the experiments you acquired nothing but memory of someone else telling a story.
4% in the US hunt now. So to get people living rather than acquiescing, all you office drones are going to have to learn your way out of helplessness. Go acquire knowledge of how to grow a potato.
You won't because you don't want to acquire knowledge. You want the world to gift you knowledge and experience through as little effort of your own as possible. Typical American capitalist. 8 billion across the globe aren't that impressed by 300 millions obvious grift.
We tried making knowledge free and available to every online. Capitalists came and gobbled it up to sell back to us as "AI". Unfortunately we can't have nice things with people taking advantage of it.
They pardoned the Silk Road drug lord to go after a copyright infringement-lord instead? It's not even in their effective jurisdiction, if this indeed is a Russian national. Don't they have more important Russian crimes to investigate?
I read there was a US government investigation tracking Ukranian children abducted by Russian forces, but supposedly there weren't enough resources [0] to sustain that.
[0] https://www.npr.org/2025/03/19/nx-s1-5333328/trump-admin-cut...
> They pardoned the Silk Road drug lord to go after a copyright infringement-lord instead?
The president’s pardons are not popular with the FBI and law enforcement. The FBI is not happy about doing all of the work to prosecute people only to have the president override it for political reasons.
I don't think it is political reasons, seems like it is for large donation reasons.
That is a political reason...
It convinces others that you're willing to pardon them too in exchange for money and convincing other people is the definition of politics.
That is not political, it is purely a service offered at a price. There is no specific political agenda behind these pardons (i.e., they don't pardon only folks who are, for example, Evangelicals or anti-immigration or whatever), the only criteria is payment.
> [1] Politics
> Politics (from Ancient Greek πολιτικά (politiká) 'affairs of the cities') is the set of activities that are associated with making decisions in groups, or other forms of power relations among individuals, such as the distribution of status or resources.
I don't know how much more obvious I can make this for you. Bribery is political.
> Bribery is political
Bribery is political. But it's not taken to be a usual part of politics in the West. (Similar to how the Roman word for ambush was the same as their word for treason. Treason isn't taken to be a usual part of politics. Ambush, for them, not a usual part of warfare.)
Basically, you're both right because what is and isn't political is itself a political question.
Word meanings evolve. Virtue literally means "manliness" in Classical Latin but only a pedantic dick would insist we use it in that sense. Polis and it's related words meant something different to the Greeks than they do to us.
Right, "Politics" evolved from "affairs of the cities" to "the set of activities that are associated with making decisions in groups, or other forms of power relations among individuals, such as the distribution of status or resources.".
it’s the current meaning he’s reciting he’s just adding etymological context
Selling pardons for money is inherently a very political act. It means that you are aligning yourself with moneyed interests, which is clearly the heart of Trumpist politics. Setting ideology aside even, the open selling of pardons sends the message to the moneyed interests in general that he's on their side, even if they don't need a pardon at this exact moment. It serves both practical (get rich people to like you and therefore donate money to your campaigns and causes to help them succeed) and ideological (supply-sider-esque doctrine going back to at least the protestant reformation says that rich people should be in charge because they're rich, QED) purposes.
The hypothetical pardon was promised by then candidate-Trump in a speech at the Libertarian Party convention.
The specific political agenda was to get support from libertarians, who lean conservative, but don't like Trump much - because he rejects libertarianism.
That's as political as you can possibly get. It wasn't a behind the scenes thing. It was literally announced at a political convention.
> they don't pardon only folks who are, for example, Evangelicals or anti-immigration or whatever
Who has Trump pardoned that wasn't a supporter of his?
I'd like for you to define "political" please.
> seems like it is for large donation reasons.
more like large bribe reasons.
In the US, this is much the same thing
Is there a difference?
I guess OP means to say it is not idealogical reasons.
Op means to say this type of pardon is not to meant to win votes or satisfy the demands of constituents, Like with convicted cops or people with weed related crimes etc or pardoning draft dodgers after Vietnam or civil war and so on .
While money is involved deeply in politics and financial corruption is there , occasionally idealogical (political) actions without direct financial benefits also happen.
It is hard to say whether this pardon of Silk Road founder was motivated by libertarian, or crypto community pressure or by financial donations to the party etc both are possible even at the same time but they are different considerations
> I guess OP means to say it is not idealogical reasons.
“Government exists for the personal benefit of the leader” (or simply “for my personal benefit as the leader”, with even less generalization beyond that) is an ideology, actually.
It’s not one that is popular to embrace publicly, but, that's hardly unique along real ideologies.
In 2019, Giuliani's assistant chided John Kirakou that pardons couldn't be discussed in his presence but that the fee was $1 million for Giuliani and $1 million for Trump. Given inflation, I'd bet that pardons now cost around $3 million.
> The president’s pardons are not popular with the FBI and law enforcement
Well if they don't like it I'm sure he would be happy to start a bidding war.
You are talking about Kash Patel's FBI. The guy who has a hit song and book called "The Plot Against The King" pretending the 2020 election has been rigged and who maintains it to this day.
The FBI does what Trump tells them to do, that's it.
> The president’s pardons are not popular with the FBI and law enforcement.
This is an unverified blanket statement. We don't know what percentage of the FBI and law enforcement agree or disagree on anything.
Source? Any of them still employed?
They got me—a copyright infringement lord—too. The FBI profile assigned to me even wrote in a case study that the FBI thought I was making millions, amongst other misses.
Their priorities are highly political.
Would love to hear more about this if you're inclined to share or have written about it somewhere. Legal contacts betweent he fedreal government and individuals are often surreal.
The case study: https://ieeexplore.ieee.org/document/10628922/
Appreciate it. Will read both of these this evening.
> It's not even in their effective jurisdiction
Like that ever stopped USA before...
It’s currently not stopping the US federal government domestically.
Shouldn't that be an Interpol or UN responsibility? Why is the US tracking foreign children?
The US gov doesn’t even care about copyright infringement, just in the cases where big companies are inconvenienced by it and it’s done by an individual / small company instead of a mega AI corp swallowing up all copyrighted content to vomit out their own spin on it through algorithms.
Federal investigations tend to only go after big fish yes.
The root problem is the IP laws that congress passed. There will always be large pressure on law enforcement from the industry if you give them that leash.
there's no lordship because afaik there's no direct profit
> Silk Road drug lord
Oh please. Ross was no saint by any stretch and it does look like he may have made a very dark decision at one point, but it didn't happen in a vacuum. There's a mountain of details and nuance around that case, including a whole host of law enforcement abuses that many people would find distasteful if not sickening if they actually got the whole story.
>it does look like he may have made a very dark decision at one point
very vague. care to expound?
Entrapment. The FBI posed as a user and convinced Ross that some people needed to be taken out, offered to do it, arrested him, dropped the murder-for-hire charges because they didn't want to play that game in court, knowing it would backfire, while still using those (unconvicted) charges to publicly smear him and influence the judge's decisions, and finally stole his Bitcoin.
Two agents went to prison over this. Those same agents have a history of fraud and abuse.
Trump pardoned Ross largely to buy the (big L) Libertarian vote. It was announced at his speech with the Libertarian Convention.
Not for any ideological reason.
It's not the libertarian vote that he cared about in particular so much as it was the firehose of crypto money that was supporting free ross.
Both are true. For a long time the Libertarian party was seen as drawing away small numbers of protest votes from the GOP, being populated by (mostly) guys who rejected Democratic over-regulation and nanny-statism but also rejected the GOP's anti-abortion politics, criminalization of drugs etc.
A few years ago there was an organized effort to capture key roles in the Libertarian party and focus the organization more on property rights and capitalism, with less emphasis on personal freedoms and constitutional limitations on government. This effectively split the Libertarian party, neutering it as electoral factor.
Now, the Libertarian party never mustered a large share of the vote, but many electoral contests are won at the margins. They managed to get ~3% of the vote in 2016, but lost >80% of that over the following 2 elections.
https://www.the-pechko-perspective.com/political-commentary/...
https://www.splcenter.org/resources/hatewatch/libertarian-pa...
https://en.wikipedia.org/wiki/List_of_United_States_Libertar...
This seems more likely - how many libertarians are there in the US? Surely there are much larger groups you can appeal to if votes is what you're after
There are dozens of us. And none of us can agree on anything.
Libertarians are like independents except noone wants to try to win us over
The Libertarian party got ~4.5 million votes in 2016. Getting some of those votes, or dissuading them from voting, is enough to make a difference in a tight race. See my other answer upthread for more context.
I don't think the crypto crowd was ever at hazard at not voting for Trump, so I'm not sure what the advantage would have been with respect to them. However, the libertarian crowd was.
As a libertarian voter, the pardon for Ross was the only thing Trump did that actually brought me pause. To the point, I felt immensely guilty for not voting for him when I voted (L) because I knew[thought] I was damning Ross to a jail cell. It weighed on my conscious for a long time after the vote, an it wasn't until Trump won I felt somewhat absolved of the guilt.
My personal opinion regarding the Ross pardon is that the Libertarian Party sold its soul for a donut. They could have gotten way more out of Trump than pardoning one particular Internet drug dealer.
Internet drug dealer doesn't really concern me, but he tried to have someone killed!
Look, when Jeffrey Bezos and Larry the Lawnmower ask Trump for an FBI investigation and send him over another solid gold turd or whatever bribe is fitting for such a request, they expect results.
As someone who has been the target of an FBI investigation for what was effectively criminal copyright infringement (later arrested and did time in prison), my only takeaway is that this, if anything, should just be a civil suit just like so many other similar cases of copyright issues.
In my personal experience, the priorities of the FBI are typically highly politically motivated. The exceptions are if you’re doing something seriously icky, or doing fraud that deceives people.
For those interested in what’s reported and what actually happens, I’ve made some comments on my case and my experience here: https://prison.josh.mn
FBI wants to remind everyone that only US mega-cap companies can scrape the entire internet, not share the data, use it to train AI models and then charge people to use chatbots that use this 'laundered' data. Anybody else attempting to do this is a criminal in their eyes and must be punished.
Archive should just rebrand as an AI start up then offer an 'llm' that is suspiciously 'over-trained' and happens to spit out the site you query exactly... Copyright infringement? Nay! Over-training! "A fix is coming soon™!"
I pay subscriptions to some of these sites and still use archive.is on them because it is a more pleasant reading experience. No auth failures, no annoying popover windows begging me to subscribe to their dumb newsletter. Just the internet equivalent of a static piece of newsprint.
my personal theory is that archive.is has paid subscription accounts (legit or via botnet) to most of the major news outlets and edits the html to make the sites look not logged in. I wonder if they do it by hand or by doing something like : https://github.com/pirate/html-private-set-intersection
in my experience it's just a headless browser with a bypass-paywalls extension
Same, I also use links from HN threads and I donated to archive.is in the past. I don't want them gone.
I used to do the same with Lynx but enough websites have now broken it.
ublock with annoyance filters also solves this
Is there any more annoying popup than the newsletter popup? I'd rather see a targeted ad than that BS.
NO! I do not want your newsletter! I wouldn't even have an email address if it wasn't absolutely required to operate in society today. The less email I get, the better!
Email is becoming like fax machines: An old, dated technology that refuses to die.
Personally I don't mind an offer to subscribe to the newsletter but Substack is way to aggressive. They show the prompt even before I have finished the article (How do I know if I want to subscribe?) and obscure the article (actively working against what they know what I am trying to do). So I now just immediately back out when I see that. I won't visit sites that are purposely harming the experience.
Physical feels that way to me sometimes. In the US, I get assaulted on a constant basis by mailers and ads for things I never expressed any interest in. Waste of time, waste of paper, waste of resources.
When there are a few simple nice things making our lives a little bit more bearable, there are always other zealous assholes desperate to ruin that.
Here I speak about this site, but everyday we have new cases of that. Like "new tax on anything that starts to be popular" for France, or Google trying to kill our privacy and F-Droid by requiring all app devs to have attestation from them.
So interesting that with this link, I saw the whole article is a couple paragraphs. With the original link, I gave up after the second ad that almost covered my whole screen on mobile. Too many ads are a terrible user experience that doesn’t let us read anything.
I also resorted to using archive.is when I first visited the page so that I wouldn't need to agree to their data collection for personalization.
chef's kiss
how coincidental
The subpoena cites the following statute as authorization: "(1)(A) In any investigation of (i)(I) a Federal health care offense; or (II) a Federal offense involving the sexual exploitation or abuse of children, the Attorney General; or (ii) an offense under section 871 or 879, or a threat against a person protected by the United States Secret Service under paragraph Secret Service determines that the threat constituting the offense or the threat against the person protected is imminent"
One of the agents named in the subpoena appears to have previously worked on child exploitation cases years ago:
https://www.supremecourt.gov/DocketPDF/22/22-6039/245948/202...
Now that might be an interesting angle.
1. Put up CSAM on your unlisted domain briefly.
2. Archive page and delete site.
3. Send people archive link.
I can confirm that (something similar to that) is what regularly occurs to share CSAM.
Have dealt with content from when I was ~11 on Omegle appearing across the internet for years at this point (NCMEC is an amazing resource).
Archive sites are regularly abused by bad actors.
Here is real example on archive.is:
https://archive.is/https://ezgif.com/maker/*
I submitted multiple complaints to NCMEC but didn’t get results. Germany, though, was able to get the archives purged.
On the page, you will see the text:
> In response to a request we received from 'jugendschutz.net' the page is not currently available.
I think owner mentioned in a blog post (or on twitter?) this is indeed happening, but I forgot the exact wording to google it.
UPD Found this by googling "site:blog.archive.today abuse":
https://blog.archive.today/post/117011183286/yesterday-i-did... (2015)
That seems like something that should be handled with a simple takedown request and those behind archive.is would almost certainly comply. 99.999% of people using archive.is are using it to bypass news article paywalls nothing more. Which, if we're honest, is the real reason why the FBI is going after them.
Personal anecdote but I almost never use these archive sites to bypass paywalls. I only use it when I want to see how establishment news sites somehow sometimes accidentally tell the truth, then, when they get the call, they try to purge their original reporting. Again, it might be my personal bias, but in my opinion, this is the main reason they are going after them. Because these websites let people prove the hypocrisy and the lies.
thank you!
Is there a dump of all archive.is sites (similar to libgen dumps) in case it goes down, so it could be set back up again?
It is pretty sad that this is happening and that it apparently is at risk of just disappearing soon. I understand there are a lot of ethical concerns with that site, but if I use like the Internet Archive's Wayback Machine to try to save some specific documentation pages for certain proprietary software, it absolutely fails to actually save the content. So then it is just a bit more difficult to save a particular knowledge base article before it might get rewritten or updated.
not sure what's taking the FBI so long, to me it seems obvious: https://drive.google.com/file/d/1M6PMQrehmeuRU_KDd_PTKsTtVNN...
Surprised he's American. I hope he finds refuge but there's not many places that either don't have strong IP laws or don't have US extraditions.
Someone should make a site like archive.is that runs the saved page through an LLM to summarize the main points, and perhaps extract a few critical quotes (unfortunately, at the LLM's discretion, but better than nothing). The law is their greatest enemy.
No, nobody should. Don't know why any trust the gossip slop bots given the extra work required. Pretty brave to trust another word guessing bot when its unable to stop making stuff up. Might want to ask it about Smith-Mundt 2012. Bill says discern, a lot, for a reason.
...as markdown please.
Dumb question: why do news websites have such a hard time keeping users logged in? Like I can go an entire year without getting logged out of gmail. But can't go more than a few days before getting logged out of news websites.
I have subscribed to news sites and still use something like archive.is because it is faster than my paid experience.
The government can take down huge criminal networks on the darkweb but can't identify the owner of a clearnet site?
Since you refer to the darkweb. The gov has extensivley studied Tor and likely has zero day exploits for the Tor browser and operates a bunch of Tor relays. Given enough time and effort it is very much possible for state actors to identify Tor users.
But unless you are a high profile gov target, Tor protects you well.
> unless you are a high profile gov target, Tor protects you well.
How do you really know that? I understand the theory, but do you have evidence? Have you tested it or read research that has tested it?
I would hesitate to give advice to people when they could get hurt.
> do you have evidence
Well it is of course not possible to 100% prove that Tor protects your privacy.
But the lack of evidence is also evidence. While we have evidence that the gov was able to identify indiviuals using Tor e.g. hosting drug portals, there has been no reports that individuals or companies are able to de-anonymize Tor users.
Tor was created by the US Navy.
And?
It’s intriguing trivia for people of all persuasions.
Government can exploit Firefox, more at 11.
That owner is not so simple - I recall how they alleged in a Wikipedia discussion he(?) used some botnet or proxy network for adding archive.is mirror links to Wiki entries: https://en.wikipedia.org/wiki/Wikipedia:Requests_for_comment...
They can and they will. Filing a subpoena for information is a step in that process.
If the WHOIS records are falsified they'll start looking at payment information.
Who said they can't?
We're pardoning fraudsters left and right who bribe the president.
But archive.is ... that people use to read and be informed about the world around them, better get that guy.
The US gov isn't a monolith. We don't know where the pressure came from or when the investigation started.
IMO this is more an indictment of the president being able to pardon (any president, not just current one).
IMO the president should, at best, be an additional appeals round. (But probably just not involved in the Judicial because separation of powers is good)
Exactly. Now that the facade of being a country with integrity and equality is thoroughly shattered, good luck getting public support for shutting down a website that lets people read news for free. Shit's on fire yo; we got bigger problems than that.
They didn't pay the bribe or tickle the diapered royal pink starfish properly. The Just Us system operates on the principle of favoritism with selective privilege/retribution rather than consistent fairness. They're perfectly fine having the DNI being a Russian mole and 47 rolling out the red carpet for a sanctioned war criminal.
In this day and age, MAANG, lacking integrity and values, bet on flattery and bribery as business expenses to ensure favorable treats instead of being punished.
While strangely unpopular here, Yasha Levine's[0] well documented premise is that the entire existence of the internet is designed for surveillance and content control, down to the chip level, and this is mandated and enforced through laws as well as more covert agreements.
[0] https://www.amazon.com/Surveillance-Valley-Military-History-...
It's strangely unpopular because it's wrong in the one place techies care about: the details.
In broad strokes, it's true to say that the Internet was created as a surveillance and control tool. But this was not a big design up front with those goals as built-in capabilities. There's nothing in TCP/IP you can point to and say, "yes, this is the surveillance bit", or "yes, this is the government control bit". "Down to the chip level" is just plain wrong. Yes, you could argue that the Internet was enabling those things, but that's true of all communications technology, if not just the basic concept of human socialization[0].
And, in practice, if the US had actually intended for the Internet to be a surveillance and control tool, it was sure as shit really fucking bad at making use of it. The only country that actually realized it needed to censor the Internet to maintain cultural/social hegemony was China, which is why they got into network censorship early. By the time America realized it wanted that level of control it had to outsource the wetwork to creative industry and advertising companies.
[0] Most neurotypical people fail to recognize this.
Techies really like to believe that they are building a bright future for humanity. Telling them that what they build is a high-tech concentration camp won't be received well.
"It is difficult to get a man to understand something, when his salary depends on his not understanding it."
-Upton Sinclair
Related HN discussion from 2023: https://news.ycombinator.com/item?id=37009598
Never knew archive.is was run by a "masked man"
Looks like `archive.is` is currently using reCaptcha. So Google might be able to figure out and tell the FBI who runs it. (If not by data around the registration, then by data around accesses to the site that seem to be by a developer of it, coupled with their cross-site tracking data.)
I've also seen Cloudflare similarly in the loop, and they have similar cross-site tracking data.
Lesson: The same third-party tech surveillance companies to which you sell out all your visitors, can also violate you.
Coincidentally, their adoption of Google CAPTCHA (along with requiring javascript) is why I stopped using archive.today. I don't particularly want either of those entities executing mystery code in my browser, or on my computer at all.
Helping Google to collect records of my reading habits is also unappealing.
I notice the iamadamdev paywall bypasser extension was also taken down with a DMCA request.
Mirror https://github.com/nikolqyy/bypass-paywalls-chrome
Let's build and share more and better tools to help ensure poor kids are allowed to learn.
Information, knowledge, and education do not belong only to those with money.
Development is continuing here, the link you gave is just an old mirror:
https://gitflic.ru/project/magnolia1234/bypass-paywalls-fire...
Thanks, wikipedia has more info and good links to related stuff:
https://en.wikipedia.org/wiki/Bypass_Paywalls_Clean
see also esp:
this is a waste tax payer funds.
This might not be about copyright. I generally avoid these mirror sites because they seem like the perfect opportunity for watering hole attacks. The challenge with a normal watering hole attack is that you have to control the site in question either by hacking or infiltrating it. Imagine however if you were able to act as a middle man to the most popular websites in the world, and people would voluntarily post links to your site all over the internet, including very valuable audiences (like HN). You would have free rein to selectively inject malware to just readers at targeted IP blocks, minimizing chances of detection because most users would never be served malware. The possibilities are endless, government espionage, corporate espionage, activists, political opponents.
To be clear I have no reason to believe specific instances of these sites are malicious, but I would be shocked if black hats weren't trying to get into this space in general.
For sure, you shouldn't just trust whatever random mirroring site pops up (in fact, you probably should trust almost none of them), but archive.is has established themselves pretty credibly IMHO. At some point it could turn, but I don't think we should kill them now just in case they turn at some point.
The fact that the FBI is involved, and given the insane amount of IP protection racket stuff going on, I think it's pretty highly likely this is all about copyright. I think the powerful interests care more about copyright than they do about most other things.
Maybe, but the subpoena doesn't shed light on what they are being investigated for. It is only demanding information.
The FBI could be investigating them for archive.today, they could be investigated because of that apparent botnet, they could be investigating them because some billionaire media mogul friend of the current POTUS is outraged at the loss of revenue. To the best of my knowledge, the reasons aren't public.
Still, it doesn't mean we shouldn't be asking questions or expressing concern over this.
News aggregator The Drudge Report recently started using archive.is links to articles. That might have angered some publishers.
Without the popups, care of the target: https://archive.ph/FEcEi
Someone at archive needs to prepare a torrent file right away
I doubt this has anything to do with copyright law. I'm certain it has everything to do with certain things needing memoryholing and archive.* operators' lack of compliance.
I will be devastated if this site gets taken down. I subscribe to pinboard.in for personal website bookmarking but even that is not 100% guaranteed to successfully cache a copy of the page.
>According to this, Archive.today uses a botnet with changing IP addresses to circumvent anti-scraping measures.
Archive.today uses Tor exit nodes when all of its main server IPs are blocked, so I believe this to be a disingenuous claim.
And this very news site's settings are "Data processing by advertising providers including personalised advertising with profiling - Consent required for free use," funnily enough
ironic that archive.is itself has so many bot protections...!
> Another private investigation from 2024 comes to a different conclusion. It names a software developer from New York as the alleged operator. According to this investigation, following the trail to Eastern Europe proved to be a red herring.
Any pointers to what this "private investigation" is? The other linked blog pointing to Russia (or at least a Russian) seems pretty convincing:
https://gyrovague.com/2023/08/05/archive-today-on-the-trail-...
> There are also indications that the operator(s) are based in Russia.
That's long been my assumption.
What I haven't known was whether this was good Russian people (culturally valuing literature and intellect) wanting to be able to access articles that they can't afford.
Nor whether it was or could become something sketchier (e.g., feeding spy databases, or one nice Chrome zero-day and strategic timing away from compromising engineering workstations at most US tech companies where an employee reads HN).
But what actually bothers me about the misc `archive.*` sites is how HN routinely uses them, for US tech company workers to circumvent paywalls for struggling journalism organizations. This piracy practice seems to have the unofficial blessing of the US tech investor firm that runs and moderates HN. Besides whatever laws this is breaking, subjectively, it feels to me like crossing an ethical line, and also (economically) like punching down.
I only very rarely read articles from these sites, and on these occasions I tend to get a lot of value out of it. I’d be happy to pay a dollar (or a couple bucks) per article. But I tend to forget about my subscriptions, and don’t read enough articles to justify it, so I’ll be wasting a lot of money buying it.
The problem with the paywalls is that everyone offers a subscription. If you want to read a single article you do not want to subscribe to some US newspaper.
x402 solves this.
X402 doesn’t solve this, because the publishers don’t want to sell you just a single article.
Just because they don't 'want' to, doesn't mean it's not a good solution. Clearly they're not getting me to pay for their whole site subscription so why not just sell me the article? The biggest issue with most of these services is the lack of consumer 'ease' by which the creator can actually get paid. It's the same reason why I'm seeing all my friends go back to piracy, it was nice when we had a consumer convenient place to consume our content. Just look at Steam, it's easier than ever to go pirate the majority of games, but my Steam library just keeps getting bigger. I'm not against buying things, I'm against shitty services.
I can't argue with your ethical concerns.
The internet is fundamentally different than print though—perhaps this fundamental change to journalism requires another way to pay the bills. (Advertising is the obvious one.)
Or maybe we, as a society (because of our internet ways) simply don't deserve these services any longer.
Perhaps the internet itself is the problem. What if instead that was the big mistake after all?
Called passion. Pays in spades. Something bots know not of. Includes carbon bots.
FBI could just create Great America Wall and block it, what's the problem?
Archive everything you can in Chinese, while you still can, if you think the folks over there may have posted anything worth you reading…
The FBI is targeting Archive.is, a botnet-powered archive that has operated openly for years. What a fascinating mix of tech and legal exposure.
They are probably just using proxies to scrape from and not directly or knowingly using proxies supplied by botnets.
Knowingly or not, they're still an accomplice. They're basically the getaway driver for cached webpages. /s
Wait why should we have a problem with this?
We don’t have a problem. That was sarcasm.
What is that FBI wants to hide but not making it public and why?
We can't lose that site. Hacker news can't exist without it in this day and age of paywalls.
Btw their leadership names are listed on their website
Wait is archive.is a bad website to go to?
Using FBI activity as a proxy for "good" or "bad" associations is folly.
"Infamous"? About as infamous as heise.de. Weird framing. Many people do not like the past being available for reference when they lie about in the future. And that's what this federal attack stems from.
"who controls the past controls the future: who controls the present controls the past"
I just posted an exerpt of the 404 article but decided to link the source.
My thoughts exactly. This title is needlessly editorializing.
Why on earth did you get downvoted?
The FBI is attempting to unmask the owner behind archive.today, a popular archiving site that is also regularly used to bypass paywalls on the internet and to avoid sending traffic to the original publishers of web content, according to a subpoena posted by the website. The FBI subpoena says it is part of a criminal investigation, though it does not provide any details about what alleged crime is being investigated. Archive.today is also popularly known by several of its mirrors, including archive.is and archive.ph.
The subpoena, which was posted on X by archive.today on October 30, was sent by the FBI to Tucows, a popular Canadian domain registrar. It demands that Tucows give the FBI the “customer or subscriber name, address of service, and billing address” and other information about the “customer behind archive.today.”
“THE INFORMATION SOUGHT THROUGH THIS SUBPOENA RELATES TO A FEDERAL CRIMINAL INVESTIGATION BEING CONDUCTED BY THE FBI,” the subpoena says. “YOUR COMPANY IS REQUIRED TO FURNISH THIS INFORMATION. YOU ARE REQUESTED NOT TO DISCLOSE THE EXISTENCE OF THIS SUBPOENA INDEFINITELY AS ANY SUCH DISCLOSURE COULD INTERFERE WITH AN ONGOING INVESTIGATION AND ENFORCEMENT OF THE LAW.”
The subpoena also requests “Local and long distance telephone connection records (examples include: incoming and outgoing calls, push-to-talk, and SMS/MMS connection records); Means and source of payment (including any credit card or bank account number); Records of session times and duration for Internet connectivity; Telephone or Instrument number (including IMEI, IMSI, UFMI, and ESN) and/or other customer/subscriber number(s) used to identify customer/subscriber, including any temporarily assigned network address (including Internet Protocol addresses); Types of service used (e.g. push-to-talk, text, three-way calling, email services, cloud computing, gaming services, etc.)”
-snip-
Read more: https://www.404media.co/fbi-tries-to-unmask-owner-of-infamou...
> YOU ARE REQUESTED NOT TO DISCLOSE THE EXISTENCE OF THIS SUBPOENA INDEFINITELY AS ANY SUCH DISCLOSURE COULD INTERFERE WITH AN ONGOING INVESTIGATION AND ENFORCEMENT OF THE LAW.
Is this actually a mere request, as in the receiver is _not_ required to avoid disclosure?
Separately—can't believe tucows is still around!
Isn't the whole thing a request? The FBI has no power in Canada unless they go through Canadian legal channels, no? If I received a subpoena from a foreign sovereign I would just use it as toilet paper.
Pretty sure if Tucows had been in the US, the "request" would have been a gag order or something alike.
Until you travel to the US 10 years later for someone's wedding or what not completely forgetting about the matter and you end up arrested
They probably cannot require this. They may be able to get you on interfering with their investigation if you disclosed with the intent of interfering. Probably adding this notice helps them prove you were aware of the potential to cause interference, at least. IANAL.
Well Tucows is Canadian so the FBI can take their “request” somewhere else.
As a Canadian, I really hope Tucows is going to send a particularly nasty response to the FBI. Canada should never collaborate with any US authorities!
> Canada should never collaborate with any US authorities!
Cross-border collaboration is a good thing. Our agencies regularly collaborate to bring people who feel insulted and emboldened to account for their crimes. This works both ways.
As someone who has dealt with media of me as a minor (~around 11/yo) from Omegle being shared across the internet, the role archivers play in keeping illegal content “alive” isn’t well recognized. Thankfully, the Internet Archive has a matured process to purge pages that host illegal content.
We do not know what the investigation is for. All is up to speculation. Not all investigations are bad.
Here is an example on archive.is. I submitted multiple complaints to NCMEC but didn’t get results. Germany, though, was able to get the archives purged.
https://archive.is/https://ezgif.com/maker/*
On the page, you will see:
> In response to a request we received from 'jugendschutz.net' the page is not currently available.
That page held many, many images of minors. It is good that it is gone.
August 12, 2025 - Canadian Man Sentenced to 188 Months for Attempted Online Enticement of a Minor and Possessing Child Pornography [1]
August 21, 2024 - Canadian National Extradited To The United States Pleads Guilty To Production Of Child Sex Abuse Material And Enticement Of Minors
December 20, 2024 - Extradited Canadian National Sentenced To Life In Federal Prison for producing child sexual abuse material and enticement of a minor [3]
[1] https://www.justice.gov/usao-ndny/pr/canadian-man-sentenced-...
[2] https://www.justice.gov/usao-mdfl/pr/canadian-national-extra...
[3] https://www.justice.gov/usao-mdfl/pr/extradited-canadian-nat...
> Cross-border collaboration is a good thing
IMO it's only a good thing when it's a good thing. There are plenty of reasons it could be a bad thing too. For example, Edward Snowden probably would have been hung by now if russia cross-border collaborated.
The term that Cory Doctorow has called these styles of stunts is:
Felony contempt of business model.
Turns out, our very user, Saurik, came up with this term!
https://pluralistic.net/2022/10/23/how-to-fix-cars-by-breaki...
I'm concerned that, even here on HN, we are underestimating both the magnitude of this looming conflict and also its inevitable conclusion.
The internet is bigger than you and me, and it's bigger than computers. It is an evolutionary force. It is not going to be stopped, and certainly not by states whose popularity and authority are waning so rapidly.
Moreover, the internet keeps as its core function the proclivity to copy and store bytes, and from this very simple mechanism emerges a large set of tools and norms that supplant nearly all of the ways that nation states perpetuate power.
What we desperately need are the elder statesmen and women to stand up and soberly see the writing on the wall, and gracefully deprecate the systems of which they will soon lose control, starting with nuclear weapons.
I don't believe this has to end in violence or acrimony of any kind. But we have run out of time to act like petulant children, crying that somebody took our empire away.
Very soon (ie, in the next couple centuries, maybe sooner), a few small nation states will adopt frameworks of zero IP, allowing all the content of the internet to be housed there, and from there, accessed by the entire planet.
Some other nation states may attempt some kind of embargo or sanctions, but these will obviously fail, just as the attempts by Russia and China to censor the internet within their borders have failed (and are failing with greater volume with each passing day). And before you cry that adoption in China is too low to support this conclusion, consider that the work to resist the GFW has begat some of the best networking tools in the world, with rapidity of evolution increasing, not decreasing. Even if fear and violence can stem adoption for a few decades or even centuries, the toolchain continues to grow and will tip the scales over sufficiently long time scales.
Let's not let this become a world information war. Let's install peace now while it's still easy to do. Let's dispense with IP and live in a world of joyous open access to information.
One can only imagine the sharing and reading histories the operator has accumulated on the people using it. No restrictions on how the operator can use that data
Archive.today is very popular with HN commenters
What histories? Does archive.is take your email, phone, credit card, and passport pic when you want to read anything? The most there is is just an IP address in the server logs, for most users, rotated by their ISP on regular basis, easily obscured with a VPN.
This need to make IP-infringement sound ominous by invoking some ill-defined spy plot is a tired cliche.
There are many mechanisms, widely used, to aggregate information from many sources into a profile of you, and using your IP as an identifier isn't hard. Many lawsuits find their targets based on IP addresses, for example.
> easily obscured with a VPN
I think we can expect that commercial VPNs are compromised, at least by intelligence services. Imagine you opened a bar and advertised, 'dissidents come here to drink in privacy'. I'm sure you'd attract others too to an obviously target-rich environment.
When the proxy operator is (1) anonymous, (2) the potential target of coercion, and (3) outside the user's jurisdiction, how does a proxy user verify (a) whether the operator collects, or is being forced to collect, data or (b) what it might do, or be forced to do, with colllected data
Perhaps the answer is they don't verify
Definition of "histories"
Records of every URL submitted and accessed by a given IP address+browser fingerprint^1
1. Archive.today sites use a form of "pixel tracking" to collect IP addresses via popular graphical browsers, the ones that are required to be used to solve CAPTCHAs, that automatically request URLs in HTML tags with the "src" attribute, e.g., "img" tags
2. Archive.today sites serve CAPTCHAs to some users^3 which force them to enable Javascript and share browser information
3. For example, those users not using or appearing to use popular graphical browsers
They don't collect any personal information - one of the reasons why it is so popular.
How do you know? One’s IP address counts as PII.
How do you know?
Uh I hope for the best, some websites opted-out of archive.org, so archive.is is my alternative.
While we're here how does archive.today bypass paywalls?
>What scraper or headless browser are you using? it works so well.
>Before 2019 - PhantomJS, after - ordinary (not headless) Chromium/80 with few small patches.
https://blog.archive.today/post/618635148292964352/what-scra... (2020)
>Archive.today launches real browsers (not even headless) and tries to load lazy images, unroll folded content, login into accounts if prompted with login form, remove “subscribe our maillist” modals
https://blog.archive.today/post/642952252228812800/people-of...
I get that it convincingly simulates a human but so do I (because I am a human) and I don't get through the paywall...
There are some tricks which work for different websites - for example, for NYT it's enough to manually clear nytimes.com cookies, FT used to work after click from twitter/x and so on. So I guess there is some set of heuristics.
It seems that archive.is often has the full article for sites that are completely paywalled to every non-paying visitor: no cookie-driven freebies, nothing.
Publicly revealing everything they are doing would be a strategically bad idea, obviously.
It's not inconceivable that they actually pay for access to some of the sites; it wouldn't be surprising.
They are not actually bypassing firewalls - therefore I think they are on ethically good grounds. Those sites show their full text for web crawlers - only not to humans. Basically, archive.is and the folk simulate that through various means. Headless browsers, better agent injection etc.
I don't think that's true. If it was that simple, there would be browser plugins or other apps that would replicate that behavior. Do you know of any?
Perhaps not in the same way as described above, but BPC exists.
I thought it just sees a full version for crawlers?
Nope, see r721's comment above yours for how it purportedly works.
Can they enforce DNS companies (ISP, cloudflare etc) to block these domains globally if they want to?
Cloudflare's DNS actually hasn't worked with archive.today for >5 years, due to the site returning bad results in response to Cloudflare not sending EDNS subnet info. HN comment from someone at Cloudflare: https://news.ycombinator.com/item?id=19828702
> Archive.is’s authoritative DNS servers return bad results to 1.1.1.1 when we query them. I’ve proposed we just fix it on our end but our team, quite rightly, said that too would violate the integrity of DNS and the privacy and security promises we made to our users when we launched the service.
> The archive.is owner has explained that he returns bad results to us because we don’t pass along the EDNS subnet information. This information leaks information about a requester’s IP and, in turn, sacrifices the privacy of users. This is especially problematic as we work to encrypt more DNS traffic since the request from Resolver to Authoritative DNS is typically unencrypted. We’re aware of real world examples where nationstate actors have monitored EDNS subnet information to track individuals, which was part of the motivation for the privacy and security policies of 1.1.1.1.
This was fixed/changed at some point. I use Cloudflare's DNS and it works fine for me.
This is the way to defend the free spirit of the internet.
Nothing infamous about it. It's the only way to stay informed from diverse sources since proliferation of paywalls started
And if no one pays for any of that content there will be zero ways to stay informed!
Information has existed long before the invention of money, and will outlive it.
IMO it is a false dichotomy to equate piracy with theft like that.
I think 90-99% of anything pirated (or accessed by bypassing paywalls etc) would just never be bought/paid-for if there was no alternative.
Are there no ads on the site?
Isn't paying to remove paywalls another way?
If you are OK with getting your information from one or two sources - why not. You can also subscribe to a newspaper. But surely internet can do (and did until relatively recently!) better than that.
Paywall epidemic is a recent phenomenon, internet media managed to exist before that.
Imagine tech journalists in 2025 not knowing what a canary is...
That is very funny. "AI" corporations are funding a scraper to subvert paywalls:
https://news.ycombinator.com/item?id=45835090
The FBI should investigate the "AI" companies and also the demise of Suchir Balaji, a copyright whistleblower who according to a sloppy local police investigation committed "suicide" hours after being seen cheerfully collecting a doordash delivery on CCTV.
AI provides shareholder value, while archive.is reduces shareholder value, and that's all that matters
I don't think it's even that abstract anymore. The AI companies donated to help Trump build a ballroom. archive.is didn't.
I'm so confused by this power dynamics. So you can torrent movies just fine from Meta/Google office in Germany? No matter how you look at it, the White House has holes in the walls as in Idiocracy.
Copyright lobbyists and sport broadcasters, the ultimate overlords of the web.
I thought this was common knowledge? Did they try googling it?
The thought to google it doesn't really have a chance to enter their head if they don't know about it.
Maybe they ran into a paywall
A certain country is trying to scrub the internet of evidence for its war crimes.
It could be for something far more petty, like covering up speaking gaffes. When you give petty image-obsessed people a lot of power, they’ll use it for petty, image-preserving reasons.