Comments Page - Show HN: A tool to analyze Hacker News sentiment on any term in seconds

« Back Show HN: A tool to analyze Hacker News sentiment on any term in secondsclassysoftware.ioSubmitted by lorddustingale 9 months ago

wongarsu 9 months ago
If anything, this tool tracks with my general opinion on sentiment analysis: it would be awesome if it actually worked, but most algorithms just predict everything as neutral.
For example if you search for bitwarden it ranks three comments as negative, all others as neutral. If I as a human look at actual comments about bitwarden [1] there are lots of comments about people using it and recommending it. As a human I would rate the sentiment as very positive, with some "negative" comments in between (that are really about specific situations where it's the wrong tool).
I've had some success using LLMs for sentiment analysis. An LLM can understand context and determine that in the given context "Bitwarden is the answer" is a glowing recommendation, not a neutral statement. But doing sentiment analysis that way eats a lot of resources, so I can't fault this tool for going with the more established approach that is incapable of making that leap.
1: https://hn.algolia.com/?dateRange=pastMonth&page=0&prefix=tr...
- team-o 9 months ago
  I haven't looked in the specific classifications of this particular model, but what your comment shows is the importance (IMO) of having a "no sentiment" class when classifying sentiment. E.g. if someone says "John doe is an average guy", the sentiment to John is neutral. But if someone says "John doe is my uncle" there's no sentiment and it should be classified as that. Perhaps the classifier here already takes this into account, but just thought it was worth mentioning the importance of having this extra class, or a separate pre-filter classifier. In your example I also see many that could be filtered out. E.g. "I store them in Bitwarden not in dotfiles" doesn't contain negative/neutral/positive sentiment, or at least you're not able to tell from just this sentence. I appreciate it's a fine line between neutral and no sentiment though.
  stephantul 9 months ago
  There’s some old work [1] that conceptualized sentiment as an interplay between subjectivity and sentiment. The more subjective a statement, the more “range” sentiment gets. I think this is what you are getting at.
  I don’t think it ever gained traction, probably because people aren’t interested in creating an actual theory of sentiment that matches the real world.
  [1]: https://github.com/clips/pattern/wiki/pattern-en#sentiment
  dataflow 9 months ago
  > E.g. "I store them in Bitwarden not in dotfiles" doesn't contain negative/neutral/positive sentiment, or at least you're not able to tell from just this sentence.
  That's an interesting example because when I read it it sounds to me like something slightly positive, or at least, unlikely to be negative. Because if you had a negative opinion of Bitwarden, you probably wouldn't be storing stuff in it.
- Mockapapella 9 months ago
  I think this is fair criticism for where it's at and mirrors my experience while building the tool. For generative AI at least, the smartest models + a good prompt will waffle stomp our tool in terms of quality.
  For example, while testing it on "Founder Mode" there were a couple comments that mentioned something like "I hate founder mode but I really really like this other thing that is the opposite of founder mode..." and then just continues for a couple paragraphs. It classified the comment as positive. While _technically_ true, that wasn't quite the intention.
  We think there are some ways around this that can increase the fidelity of these models that won't involve using generative AI. Like you said, doing it that way eats a ton of resources.
  wongarsu 9 months ago
  Just spitballing, but maybe a good tradeoff is to use NLP to find good candidate comments that are likely to contain a sentiment, and then analysing a small number of them with a more expensive model (say a quantized 3B or 7B LLM with a good prompt). The quality over quantity approach
  drc500free 9 months ago
  Almost 10 years ago, I ran a media sentiment analytics product that worked on hand coding. Everything automated that we tested - and that our competitors tried to launch - output garbage. The status quo for media analysis was either averaging a lot of garbage, or paying an insane amount for a very limited scope. We used automated tools to multiply the hand coders, but there was no plausible lights-out solution if accuracy really mattered.
  That's completely changed in the last 18 months. All my colleagues in the industry have switched to LLMs. They're seeing accuracy as good as hand coding was getting (these were generally college educated coders), at scale.
  Non-LLM sentiment tools were always a bit of a parlor trick that required cherry picking to survive the demo. In almost every case drilling to the actual statements revealed it was wrong on anything involving irony, humor, or even complex grammar. That's changed almost overnight.
  I think the finding that hn is "neutral" about MBAs says all that's needed about accuracy here.
  zzleeper 9 months ago
  BTW, which algo did you use to classify sentiment? bert or something related?
- undefined 9 months ago
  [deleted]
codetrotter 9 months ago
I think this is actually one of the very first times I have seen neumorphic design in the wild.
Prior to this I’ve mostly only seen it on dribbble.
I actually like this style a lot, and I wish more apps would use it. But at this point I thought that this style was one that “came and went” before it saw any significant actual use in any apps or OSes. Maybe there is still hope after all :)
Edit: oh and I had to try asking your tool for sentiment about neumorphic design after this of course. It returned my own comment lol :p and it called it “neutral”. Is it only evaluating the first paragraph that the word appears in in the comment? (Also I guess other people more commonly refer to it as “neumorphism” than as “neumorphic design” and maybe that’s why when I asked it for neumorphic design it returned my own comment.)
- wongarsu 9 months ago
  To me, neumorphic design looks like skeumorphism for those cheap interfaces made of one continuous sheet of thin printed (sometimes vacuformed) plastic with the actual buttons hidden underneath. Stuff like [1] or [2]. And while I love skeumorphism, I hate those kinds of plastic interfaces. They always use the cheapest mushy buttons, and the pressable area of the buttons is always much smaller than what the printed buttons suggest. It's probably the one kind of physical interface I hate with a passion, and imitating it digitally doesn't evoke any positive feelings in me.
  Still better than making everything flat without shadows and making me guess where I can click, I guess.
  1: https://www.alamy.com/stock-photo-finger-pressing-the-button...
  2: https://www.alamy.com/close-up-of-clothes-washing-machine-bu...
  Edit: just checked, this comment was analyzed as "Sentiment: neutral (Confidence: 79.56%)" on the topic of "neumorphic"
  4ggr0 9 months ago
  > I hate with a passion, and imitating it digitally doesn't evoke any positive feelings in me.
  > this comment was analyzed as "Sentiment: neutral (Confidence: 79.56%)"
  I wonder what kinds of heinous things you'd have to write for it to be negative...
  Almost comical that this comment is not analyzed as negative.
- Mockapapella 9 months ago
  (the other part of the team that built this here)
  Honestly makes me pretty happy you called out the theme. I've always enjoyed this style of design and was sad to see that it never picked up steam. I love how it seems to combine a digital Material design with a more physical and real feeling. I'm doing my part to bring it back.
  The tool definitely has some kinks in it that we have plans to iron out over time; we just wanted to get it in front of people to see if anybody would even like it. Right now it's just grabbing the first 256 tokens and categorizing on that, and it grabs the first 5000 comments (split over 5 calls) over the past month.
  ranger_danger 9 months ago
  It's uncommon for several reasons: it's not very accessible, its predecessor skeuomorphism (like Win95 style interfaces) was overdone for decades, and it ignores all the reasons we transitioned to flat design in the first place.
  https://www.nngroup.com/articles/skeuomorphism/
  https://www.nngroup.com/articles/flat-design/
  >Neumorphism never quite made it mainstream because it comes with its own set of problems. The low contrast does not offer sufficient visual weight, making the experience not accessible. Additionally, it is difficult to determine clickability, as neumorphism is often used inconsistently on nonclickable and clickable elements.
  Don't get me wrong, I still like the design and I think it's cool, but I understand the reasons why it never got popular.
  Mockapapella 9 months ago
  Yeah I've heard about the accessibility argument. I tried to mitigate that somewhat with the button designs having a black/while outline (depending on the theme), but I may need/want to add a "high contrast" mode later on.
  WantonQuantum 9 months ago
  An aspect of the UI I really like is the way previous results stay available while doing new ones.
  chucksmash 9 months ago
  Tokens are characters in this context?
  I tried "neumorphic design" based on the comment this replies to. It is classified as neutral.
  Mockapapella 9 months ago
  Tokens are approximately 3/4ths of a word. That might sound odd if you're unfamiliar, but in short it has to do with how LLMs break down words to process them. Letters are too granular for sufficient quality, full words don't allow LLMs to produce novel text like "dkjmcf0248375cyu18c7437tr18c237m yt034c8nwey4mtc0r23q9p,a;" (should it need to), so the industry has settled on the middle ground of a part of a word.
  Unearned5161 9 months ago
  yeah, looks epic. Makes me think of Superman's spaceship in Man Of Steel , mega clean
- Pannoniae 9 months ago
  (tangential) I wish actual skeuomorphism would also come back....Even in things you can theme, it's just choosing between different flat looks.
- tdeck 9 months ago
  Cool to know that this has a name! For some reason it made me think of some old text character based interfaces like this that had recessed gray form fields:
  https://www.cloudwisp.com/exploring-visual-basic-1-0-for-ms-...
- philjackson 9 months ago
  There was a period of time where the Mozilla Suite had this design style. Not sure I liked it much, but the nostalgia is nice.
- SuperHeavy256 9 months ago
  It looks kinda cheap imo.
bhaney 9 months ago
I like the concept and the interface, but after trying one of the suggested examples, I'm not convinced this actually works?
I tried "remote work" like the initial instructions recommended as an example. The graph it gave me showed large spikes of "neutral" sentiment with a few negligible bouts of negative sentiment and even smaller bouts of positive sentiment. The sample comment it gave was from a "Who Wants to be Hired" post where the poster demanded exclusively remote offers, which the tool classified as "neutral" (with 98.7% confidence!)
Very slick tool, but if the sentiment analysis itself doesn't really work well then I don't see what value this could have.
- lorddustingale 9 months ago
  It will definitely work better with larger datasets, we have plans to improve quality.
solardev 9 months ago
Hmm, it thinks HN is neutral on crypto. Hmmmm.
Is it actually doing anything?
- __MatrixMan__ 9 months ago
  Apparently we prefer death over Oracle. But on a second search Oracle is then better than death...
  No, I don't think it is.
  philipwhiuk 9 months ago
  I guess it's an issue when we dislike two things just as much :P
- undefined 9 months ago
  [deleted]
ranger_danger 9 months ago
My biggest issue when I tried to analyze sentiments was that there is no practical way to bias for sarcasm, such as "it's great if you love pulling your hair out".
stevage 9 months ago
A month doesn't seem like a useful time frame? Or maybe I'm misunderstanding the main use case.
Also, you could drastically improve styling on mobile. Lot of wasted space.
- Mockapapella 9 months ago
  Main case right now is to see if people like it (and so that we don't hit Algolia rate limits). You're right, a more useful case might be letting the user customize the time frame.
  As for mobile, I am but a humble backend dev, but I agree completely. Will put it on the roadmap. Thank you for your feedback!
Nuzzerino 9 months ago
Seems pointless to go through all that effort to chart this over time, and have that time limited to..... a single month.
> you can track trends with it
No, no, you can't.
IgorPartola 9 months ago
It would be really interesting to see, in the limit of this tech, how any given comment ranks on the group-think to contrarian scale for every comment you read. There are a lot of signals on HN about comment quality, like karma, account age, reputation of certain individual names, but this could be an interesting one. Wonder what kind of emergent behavior it would drive.
avodonosov 9 months ago
Idea: single number metric, something like, %-of-positive - %-of-negative avearaged over a time period.
So that we could compare terms based on this result metric: google vs microsoft, rust vs go, rust vs microsoft, etc
(Will not work for Go as it's a common word in addition to the programming language, but anyways)
- Mockapapella 9 months ago
  I like it. We'll put it on the roadmap.
xnorswap 9 months ago
Please link to the "Randomly sampled comment", there's an ellipsis but no way to actually read the comment?
I searched "apache" and the "randomly sampled comment" was a mystery:
> I think the author has a point with one-way doors slowing down the adoption of distributed systems. ...
I had to search google with that phrase to get the actual context. ( https://news.ycombinator.com/item?id=41363836 )
Which turned out to be about "Apache Beam" not the Http server.
trampish 9 months ago
Instead of "linking" to a data:image URL, consider using a <map> and several <areas> to link to posts from the given time range like so: https://hn.algolia.com/?dateEnd=1726171200&dateRange=custom&...
breck 9 months ago
Love the fact I didn't need to signup to get value out of this!
Note: I would suggest just removing dark mode for now. Works WAY better in light mode. I almost missed the light mode, and that would have been too bad.
Here's my user test: https://news.pub/?try=https://www.youtube.com/embed/2eac5XZe...
- Mockapapella 9 months ago
  Oh wow, was not expecting this level of feedback -- thank you for the video!
  Addressing some points you made in the video:
  1. Yes, this is definitely a V0.1. Thank you for taking that into account :D
  2. Interactive graphs (custom colors, intensity, what trends to show, etc.) were something we thought might be useful but wanted to hear what others had to say before we invest the time into a feature nobody wanted
  3. That super slow "Searching..." loading message should be showing updates for when it's scraping comments, when it's classifying, etc. Not sure why it's not. Some searches like "open source" are more likely to have a ton of comments to pull so it'll take longer.
  4. Default mode changed to light mode because it looks better :^)
SuperHeavy256 9 months ago
Would be great if you added buttons like 'Show Positive Sentiment Example' 'Show Neutral Sentiment Example', and 'Show Negative Sentiment Example' to the chart that will show a randomly picked comment from Hacker News with one of those sentiments. It would really help in understand the source of the information and it would really prove that your info is genuine.
avodonosov 9 months ago
Good tool to choose techologies. Java and Rust have mixed sentiment. For Brainfuck the positive coments clearly outweigh the negative ones.
nomilk 9 months ago
It’s great. Simple and works in 10 seconds. Only thing is I’d prefer to know sentiment over at least a year (defaults to just 1 month)
Still, great idea and execution
- Mockapapella 9 months ago
  Thank you! We plan on letting users select their own timeline later -- this was more of a defense for us to make sure we didn't get rate limited and everyone could try it out
- undefined 9 months ago
  [deleted]
rustdeveloper 9 months ago
Surprisingly, according to you tool, HN is neutral on “web scraping”. I noticed others also reported bias for neutral on other keywords.
eth0up 9 months ago
Exit vim - Sentiment: neutral (Confidence: 98.32%)
Stockholm Syndrome?
MeetingsBrowser 9 months ago
On mobile when I select the text input the whole page zooms so far in that I can’t see the submit button (or anything else for that matter).
Took me a minute to figure out what had happened, but I was able to submit a phrase. The response was
> Randomly sampled comment from the pulled data: Sorry, there was an error processing your message.
- undefined 9 months ago
  [deleted]
op00to 9 months ago
The sentiment of “fart” is neutral.
luke-stanley 9 months ago
`Sorry, there was an error processing your message.` It got neutral for stuff I wasn't expecting to be neutral, it shows a frequency graph but sentiment range / variability might be worth showing.
t0bia_s 9 months ago
Please, don't be funky with UI. Make it simple, without emboss design.
- madarco 9 months ago
  I literally spent more time looking at the UI than playing with the tool itself, I really love this skeuomorphism!
  If you remove the dark-gray border from the buttons it's even better!
- velcrovan 9 months ago
  For something you depend on and use a lot, I agree. For what this is, the creative UI is arguably the best part.
wslh 9 months ago
I searched for 'crypto' and noticed that most sentiments appear neutral, whereas I would expect negative sentiments to be higher. It might be an interesting case for you to test on your side.
bagels 9 months ago
Are there any terms that don't say "neutral" as the sentiment?
- solardev 9 months ago
  Found one...!
  Coffee. We apparently hate "coffee" with a 79.99% confidence. Unless you ask it again, in which case we like it with a 98.67% confidence. And if you ask it again, it's 98.86% certain that we're neutral on the topic.
  Same with "spam". Sometimes we like it, sometimes we don't. I guess it's bad in emails and good on musubis? Shrug.
- max-m 9 months ago
  "dang" is liked by HN! :D
- fortran77 9 months ago
  I tried “your mother” and it went into the positive area twice, but mostly neutral.
- WheatMillington 9 months ago
  I can't find any.
caseyy 9 months ago
NFTs — neutral with 98% confidence. Hmm…
Also, it seems like putting in the same phrase twice generates different graphs and results at least sometimes. So it’s difficult to use comparatively.
grogenaut 9 months ago
it'd be great if I could turn off neutral, it's drowning out the positive and negative lines... for example Kubernetes... mostly neutral.
analog31 9 months ago
Out of curiosity, is the sentiment index weighted for up-voted threads and comments? Because those things are a kind of sentiment index too.
- Mockapapella 9 months ago
  It isn't, just straight text
wavemode 9 months ago
I searched Blockchain and it said "neutral". This is the sample comment it displayed:
> Ah yes, because blockchain is the 100% true source of ultimate truth.
The model can't detect sarcasm.
- philipwhiuk 9 months ago
  Turns out Poe's Law affects machines just as much!
brunocvcunha 9 months ago
Cool idea. But as a red/green color blind person, I have no way to figure out what is positive/negative in the graphs.
- Mockapapella 9 months ago
  Ach, sorry 'bout that. I added a red/green colorblind toggle for generating the graphs.
ryukoposting 9 months ago
Very cool until I downloaded the graph, went back to the main page, and Firefox Mobile crashed.
poniko 9 months ago
Nice idea, not sure it works though.. got neutral on all words I tried .. even beer
gotoeleven 9 months ago
>> white board interviews
nothing
>>hobos in san francisco
nothing
>>accommodating my neurodivergence
error
>>is everyone I don't like hitler or just some people
nothing
nhggfu 9 months ago
"I'd love your feedback on the tool's usefulness"
useless junk.
undefined 9 months ago
[deleted]
aliasxneo 9 months ago
Appears to be struggling from the hug of death.
- Mockapapella 9 months ago
  We're actively taking suggestions from people to improve it. The webserver was probably just restarting with the new suggestions :)
  undefined 9 months ago
  [deleted]
  aliasxneo 9 months ago
  How often is it doing that? I've tried a few times over the last ten minutes. The latest attempt ended with Cloudflare returning a 504.
  Mockapapella 9 months ago
  If that's the case maybe I'm seeing a caching thing on my side. I upsized the droplet some
bbor 9 months ago
Cute, love the idea! Would perhaps cut the examples to be substrings to avoid long comments that mention the issue in passing once, but that’s a personal preference. Either way, actually seeing the graph on mobile is wonky, and IMO the example is way less interesting than the graph — maybe make it bigger and more front n center?
Thanks for sharing :) TIL sentiment for “communism” is slightly less negative than for “capitalism” on here! Tho both are, surprisingly, mostly neutral.
- Mockapapella 9 months ago
  1. Comments have been truncated to 100 characters
  2. The graphs now go full screen when they are clicked
  3. Most comments will be neutral. This is kind of a quantitative way to see "the silent majority"
  There's some nuance that is lost due to it classifying the comment as a whole as opposed to specific aspects of it, but it works for the most part. Thanks for trying it out!
- lorddustingale 9 months ago
  Thank you for your feedback! We appreciate it.
  We’re actively working on making the tool more user-friendly and intuitive. As expected, most sentiment is neutral, but we plan to add a toggle view in future updates to enhance the experience.
thih9 9 months ago
Congrats on the launch, the idea sounds useful, the UI looks very intuitive and to the point. I like that I don’t have to log in and can try it immediately, big plus.
Bug report, I saw inaccurate results, I asked about “native apps” and I got negative sentiment. This is contrary to my experience, afaik HN loves them.
The example comment[1] quoted “non-native apps” and is part of the discussion where people say they don’t like non-native apps.
Edit: Then I asked about non-native apps, got sentiment “neutral” and this comment (the one I’m editing now) as the example. Very unexpected!
[1]: https://news.ycombinator.com/item?id=41366882
okasakii 9 months ago
Does not seem to work. Tried a few words including; Google, Apple, Microsoft and as final test ‘Genocide’. All seemed to return just ‘neutral’. The last one should have been definitely a negative sentiment.
WheatMillington 9 months ago
RTO and Elon Musk are both overwhelmingly neutral sentiment? Seems unlikely.
undefined 9 months ago
[deleted]
dangitman 9 months ago
[dead]
beepbooptheory 9 months ago
[flagged]
- rosmax_1337 9 months ago
  What were the negative comments about?
- IgorPartola 9 months ago
  Not sure why you are being downvoted. While race isn’t the main topic of this site, I think it is important to keep our inherent biases in mind and out of our decisions when we talk about tech, startups, education, etc.
  beepbooptheory 9 months ago
  Even if not important, its simply interesting in the anthropological sense. Can only laugh that these very downvotes will reinforce the point of the comment(s) itself.