Comments Page - Launch HN: Massdriver (YC W22) – Self-serve cloud infra without the red tape

« Back Launch HN: Massdriver (YC W22) – Self-serve cloud infra without the red tapeundefinedSubmitted by coryodaniel 5 months ago

aetherspawn 4 months ago
This is cool but you need to figure out how to add a free tier, for less than 5 boxes and 1 env or something like that. There’s no way we would spend this money unless we were boiled like a frog in a pot and just went with it because we didn’t want to go back.
And $500-1000/mo is way too much, that’s more than a 10 person company spends every month on their entire CRM.
By pricing yourselves so high what you’re telling people is hey we’re a startup and we don't expect to scale big, and what I’m thinking is hey maybe this is risky having our critical infra tied up with this startup.
- coryodaniel 4 months ago
  Appreciate the feedback! We actually just changed our pricing about a week ago. We used to do individual seats, but we found that smaller teams—especially those without a dedicated ops function—typically weren’t at the stage where they needed an API layer between devs and infrastructure.
  For early-stage teams, a PaaS or ClickOps usually makes more sense. Massdriver really starts providing value when a team hits the point where devs need self-service, but ops still needs to enforce standards and visibility without becoming a bottleneck.
  That said, we do have a free tier beyond the two-week trial for open-source projects, non-profits, and some partners. Happy to add you to it if you want to try it out—would love to be wrong about earlier-stage teams, but just haven’t seen it yet!
varun_chopra 5 months ago
Oh boy, I have so many questions...
* You want to simplify infrastructure, but there's a new learning curve here. Why did you decide to go with diagramming as a solution? What other methods did you evaluate and discard?
* How does an organization with existing infrastructure implement Massdriver?
* How do you handle edge cases, custom configurations, complex logic, etc.? For example, workflows that use custom scripts or some other form of band-aid.
* The visual approach could make it too easy to piece together infrastructure without understanding the implications. How do you prevent developers from creating poorly architected systems just because you make it simple to connect components?
* When things go wrong, how do developers debug issues at the infrastructure level? Do they reach out to ops?
- TYPE_FASTER 4 months ago
  > Why did you decide to go with diagramming as a solution?
  I had a similar idea. I have enough experience with visual programming environments to be wary. Here are my thoughts on why it might be a good approach here: * It would be possible to take a whiteboard scribble and turn it into a real system. Combining this with the services available in the cloud, you end up with something really powerful. It all comes down to the level of abstraction supported. You have to be able to draw boxes at a level that adds value, but also zoom in to parameters at the service/API level as necessary. * I've worked on a team that was responsible for designing and maintaining its own AWS infrastructure. Along with that comes the responsibility for controlling cost. The idea of having a living architectural diagram that also reported cost in near real-time is really helpful, especially if you could start to do things like project cost given a level of traffic or some other measure.
  Once you have a decent library of TF modules, and an understanding of the networking and compute fundamentals, and an understanding of the services offered by your cloud provider, you have something really powerful. If a service can help accelerate that, it's worth it IMHO.
  dwill-mdcloud 4 months ago
  You have really hit the nail on the head with what we were going for! Cory and I, very early on said "We draw this stuff, agree on it, then go build it in TF which is where the problems start".
  We imagined a world where you could go into architecture review and come out of that meeting with staging stood up and ready to run your application.
  This makes sense for infra because it's mostly config management and API calls. Visual programming is rough because control structures are soo hard to visualize.
- coryodaniel 5 months ago
  I'm here for the questions!
  > * You want to simplify infrastructure, but there's a new learning curve here. Why did you decide to go with diagramming as a solution? What other methods did you evaluate and discard?
  We try to make it so both teams have to learn as little as possible. For the ops team, we are built on the tools those teams are familiar with terraform, helm, ansible, etc. Our extension model is also ops-oriented. You add add'l provisioners by writing Dockerfiles, you enforce pre-validations with JSON Schema (this is the best we could come up w/, but figured it was a safe bet ops-wise since its a part of OpenAPI). For devs, they dont have to learn the ops teams tools to provision infrastructure, they just diagram. Massdriver was originally a wall of YAML to connect all the pieces, but it felt fumbly (and like everything else).
  I wanted to make a VR version like something youd see in a bad hacker movie, but Dave told me not to get ahead of myself. :D
  > * How does an organization with existing infrastructure implement Massdriver?
  Depends on if they have IaC or not. If they have IaC, they publish the modules. If their IaC has a state backend, its usually just good to go, if they are using localfiles for state, we offer a state server they can push state into.
  If teams dont have IaC, we run workshops on "reverse terraforming" or "tofuing" and also offer professional services to codify that stuff for you.
  > * How do you handle edge cases, custom configurations, complex logic, etc.? For example, workflows that use custom scripts or some other form of band-aid.
  As noted above, its all based off common ops tooling. Lets say you wanted to use a new sec scanning tool for IaC and we don't have it in our base provisioners, you can write a dockerfile, build the image, then you can include that scanning tool in any of your massdriver configs. We also have folks doing day-2 operations with the platform. Things like database migrations and whatnot. The lines in the graph actually carry information and can push that info across different tools, so you can do things like have helm charts get information from a terraform run. You can build a provisioner with say the psql tool or a helm chart running bucardo and use it to set up replication between an old and new postgres instance.
  > * The visual approach could make it too easy to piece together infrastructure without understanding the implications. How do you prevent developers from creating poorly architected systems just because you make it simple to connect components?
  The lines and connections are actually a type system that you can extend (also based on JSON Schema). That way ops teams can encode common things into the platform once. ie. this is how we authenticate to postgres, its an AWS secret, security gruops and these IAM policies. All of that information flows across the line into the other module. The modules reject invalid types so common misconfigurations _cant_ happen. It also lets you "autocomplete" infrastructure. Lets say I'm a dev and I want to deploy a database. I can drop it on the canvas, since massdriver understands the types, it'll automatically connect it to a subnet that dev has access to.
  > * When things go wrong, how do developers debug issues at the infrastructure level? Do they reach out to ops?
  They may, we have a lot of stuff built in though to make the system as truly self-service (through day 2) as possible. There are runbooks per module so ops teams that have built out a module around a use case can put in common trouble shooting steps and its all accessible from the same graph. Alarms and metrics also show up there. Ops teams can also publish day-2 modules to the catalog, so developers can drag and drop common one-off tasks for maintenance onto their canvas and perform it.
  no_circuit 4 months ago
  The VR version of network management already existed [1]. It was called CA [2] Unicenter TNG. It really could use an update with some rendering with Unreal Engine! :D
  Unrelated but could be confused with what was seen in Jurassic Park as "Unix".
  [1] https://archive.org/details/vw_ca-unicenter-tng-demo
  [2] https://en.wikipedia.org/wiki/CA_Technologies
  varun_chopra 4 months ago
  > You add add'l provisioners by writing Dockerfiles, you enforce pre-validations with JSON Schema
  That's really neat! Thank you for answering my questions and all the best with your launch!
  coryodaniel 4 months ago
  Thanks, I appreciated the questions!
written-beyond 5 months ago
Congrats on the launch!
I'm not a seasoned DevOps professional but I'm usually the one who ends up provisioning or setting up VMs, serverless stuff and DBs. I just don't understand the product.
You make reusable TF modules that have security and policies baked in. Engineers use a UI to hookup those modules and Massdriver does the deployment work for you.
Sounds like a godsend for big teams but I don't see pre-funded Startups being able to afford a $500/mo fee. For funded ones that's highly approachable but their problems with their IaC wouldn't be as visible.
Honestly, in smaller teams you can get pretty far with just setting thing sup through your cloud providers web console and just focus on what your building.
Since the fee is kind of steep, what's the justification for this. Is it that the workflow improvements would significantly improve productivity which would justify the cost or is the service itself expensive to run and maintain.
- coryodaniel 5 months ago
  Thanks! And ... you're right!
  Massdriver isn’t aimed at pre-funded startups. Early-stage teams are often better off with a PaaS or setting things up manually until ops challenges become a bottleneck.
  Our pricing (5-seat minimum) is intentional to dissuade smaller teams. The real value kicks in when teams need self-service. Ops teams build the modules (not us), and Massdriver acts as the interface. Developers diagram what they need, and Massdriver provisions using the ops team’s standards. This keeps developers focused on building while giving ops visibility and control over what’s deployed.
  written-beyond 4 months ago
  So does mass driver become a single source of truth for all information about my companies entire cloud or does it only maintain a track of what's been deployed through it.
  I have a friend whos a manager at a large e-commerce company who's teams entire responsibility is to oversee all matters regarding their private and public cloud usage. They also manage and maintain services for internal use.
  I would love to recommend you guys to them because managing deployments from over a dozen teams located around the world is hell for them. However they have an extensive private cloud setup, would your solution be as applicable to them as it is to companies running on public clouds?
  coryodaniel 4 months ago
  It is a single source of truth, but only what is managed through the platform.
  Private cloud isn't the best experience right now, its possible, but it requires our platform being able to 'get inside' so we either need a control plane exposed to us or a VPN connection in.
  Self-hosted is our #1 requested feature, so we are cranking away at it. Its in alpha, and we're looking for testers/feedback. Would love an intro!
  withinboredom 4 months ago
  When you get self-hosted up and running, you may also want to consider open sourcing the private cloud portion as well. Think of it more as a marketing thing. Many companies <5 people tend to either go all-in on the cloud, or their own servers if bootstrapping. For teams on the cloud, they don't need your product, yet. For teams who are running their own private clouds, they do. Eventually, they'll grow into the cloud and bam, they start paying you.
  It's a long game, but might be worth it.
  reachableceo 4 months ago
  If this is for the company I think it is for, they won’t use this as they have a very strong NIH culture.
  written-beyond 4 months ago
  Never heard of the phrase, "NIH culture" before. What does that mean?
  card_zero 4 months ago
  That's "not invented here". HN could really benefit from a glossary of acronyms.
  Aeolun 4 months ago
  How do you intend to get any adoption with pricing like that? I’m certainly not going to drop several hundred dollars to see if your product is better than any of the freely available IaC tools.
  Not trying to critizice, just don’t understand how this works. I’ve got my company to pay for Pulumi after several years of usage, but I needed to be able to use it to get that far.
stackskipton 4 months ago
As SRE/DevOps/Ops type, I took a look because I've sat with salespeople with your competitors. It looks like a functional platform and another "Cloud defaults are too scary? Here is a sane default option."
>The hard part isn’t just technical, it’s socio-technical. Ops teams want control, devs want speed, and compliance creates friction between them.
You got that right and I'm not sure another tool is going to fix it.
I wouldn't say Ops wants control, it's we want to stop being paged after hours because Devs yolo stuff into production without a care in the world. Not sure tooling will fix that.
I love your (https://www.massdriver.cloud/blogs/devops-is-bullshit) blog article though. It prompted a great discussion.
- dwill-mdcloud 4 months ago
  > I wouldn't say Ops wants control, it's we want to stop being paged after hours because Devs yolo stuff into production without a care in the world. Not sure tooling will fix that.
  You have hit the nail on the head here. Our base hypothesis is the only way to solve this problem is to start with a self-service approach. If I deploy an RDS instance and nobody ever connects to it, it will never have an issue. The moment a Dev starts firing N+1's at it, I have to get up at 1am. Developers need to have ownership and accountability for their infrastructure without having to become absolute cloud experts.
  Our goal is to enable Ops teams to build catalogs of solid building blocks that developers can build into novel architectures and safely own and operate. The collaboration between Ops and Dev is delegated to software and eases this friction.
  > It looks like a functional platform and another "Cloud defaults are too scary? Here is a sane default option."
  I would push back on this notion. An Ops team builds reusable modules that match their reliability and compliance requirements. You _can_ use modules we have created but we expect that you own your IaC modules. They will conform and evolve with your organization's best practices.
  The DevOps is bullshit article is the inspiration for making a platform that manages the relationship between Dev and Ops which I think separates us from our competitors in the space.
  otterley 4 months ago
  > Our goal is to enable Ops teams to build catalogs of solid building blocks that developers can build into novel architectures and safely own and operate. The collaboration between Ops and Dev is delegated to software and eases this friction.
  What you’re describing is the Holy Grail of being able to simplify a complex system without sacrificing functionality, security, performance, cost optimization, observability, etc. This dream has been around a very long time. People keep attempting to solve the problem (because, well, there is potentially money in it) and they’ve all failed in the long run (although some vendors got wealthy until the customers saw the light). cfengine yielded to Chef/Puppet/Ansible; Terraform is a mess; VMWare is the walking dead; and K8S will probably see a similar fate once people burn out on it.
  The problem is the age-old one of leaky abstractions and jammed-up controls. Sure, an abstracted database component might be adequate for a simple workload, but as soon as the component reaches the limits of the abstraction, you’re stuck. You can’t get your hands dirty and solve the problem. And if you can, you might as well throw the abstraction out and configure the component natively.
  On top of that, you cannot effectively abstract the observability of complex workload components. These are often highly specialized pieces of software for which the metrics are not only different, but the values of which have different meanings.
  Anyway, if you figure out how to crack this nut, I applaud you. But it’s been tried before, many ways. I’m not optimistic.
  dwill-mdcloud 4 months ago
  What can I say, we are dreamers who have suffered as both Ops and Dev.
  I think the way we approach abstraction is fairly different from other Ops tools on the market. We are product engineers first. We are trying to pull in a lot of what makes product engineering work to the ops side. Our guidance on abstraction is very usecase driven. Instead of having an S3 module, you have a public website bucket, a landing zone bucket, and a CDN bucket. This prevents the module from outgrowing its usefulness.
  If it is possible to crack this nut, we will need people like you to give us the guidance to do it. We should jump on a call, no sales. I can walk you through the platform and you can tell us why we are crazy for trying!
  sgarland 4 months ago
  > Developers need to have ownership and accountability for their infrastructure without having to become absolute cloud experts.
  This will never happen. You can’t own something you don’t understand.
  Ops and Dev are different roles for a reason, and the only reason we’ve shifted away from that is to accelerate profits; yes, you can spend your way to growth, and yes, you can run massively complex systems on hardware you have never seen, nor understand. That doesn’t make it a good idea.
  dwill-mdcloud 4 months ago
  I am unsure what you are getting at. Our platform is all about Ops and Dev being separate jobs, but operating at a greater scale. Ops produce modules that can be consumed by Developers. Developers control the scale of their infra while Ops encodes security and compliance in the module. It seems like this would be the ideal model in this instance. If it is not, what does the ideal look like in your opinion?
  sgarland 4 months ago
  Just as choosing the correct algorithm, the best library, or the optimal design pattern for a given task is complex and important, so is choosing, configuring, and managing infra.
  The hyperscalers have convinced people that you don’t need to know how to run a database, you can just use RDS et al. You don’t need to know how to manage K8s, you can just use EKS. This is dangerously untrue, because those tools are good enough that most people can get them going and they’ll work reasonably well, right up until they don’t (especially RDBMS). Then you hit edge cases that require you to have a solid understanding of their architecture, as well as Linux administration – for example, understanding how Postgres’ bgwriter operates, and how it is affected by disk IOPS and latency, not to mention various kernel-level tunings. None of this matters in the slightest with small DBs, say, < 100 GiB. It may not even matter at larger scales depending on your query patterns.
  The various DB offerings (I’m going heavily on the DB example because that’s my current job) like Neon and Planetscale mostly have the right idea, IMO – stop assuming devs know or want to do ops work. They want an endpoint, and they want things to be performant, so have automatic schema and index reviews. Critically, in this model, devs are not responsible for nor accountable for the infra’s performance (more or less; discussions on schema design and its impact on DB performance aside). In other words, this has separated ops and dev.
  I say they’ve mostly got it right because they do still allow you to make bad decisions, like chucking everything into a JSONB column, using UUIDv4 as a PK, etc. Realistically a service would fail if they tried refusing to allow that flexibility, so I get it.
  For an in-house solution, though, this can be the case. The old school, rigid mentality of having to ask cranky graybeards to add a column had an extremely powerful benefit: bad decisions were much less likely to occur. Yes, it’s slower, and that’s a good thing. Everywhere I’ve been, bad decisions made in the name of velocity have come calling, and it’s a nightmare to fix them.
  In summary, I like the idea of Massdriver quite a bit, actually; I just don’t think it’s a good idea nor accurate to say that it allows devs to be responsible for their own infra, because they largely don’t want that, nor are they capable. Not for lack of intelligence, but lack of experience. Let specialists be specialists. You want a DB? Ask the DB team, and don’t get mad when they tell you your proposed schema is hot garbage. You want more compute? Ask the infra team, and don’t be surprised when they ask you if you’ve profiled your code to determine if you actually need more.
  dwill-mdcloud 4 months ago
  That's a fair assessment of the problem. I appreciate the thought you put into it. I think compiling all of those experts will be difficult for companies until they are fairly large. We hope to push out requiring that expertise for as long as possible. But in an ideal world, I don't disagree with you.
- dang 4 months ago
  Discussed a couple times here:
  DevOps Is Bullshit (2022) - https://news.ycombinator.com/item?id=36354049 - June 2023 (278 comments)
  DevOps is broken - https://news.ycombinator.com/item?id=33274988 - Oct 2022 (348 comments)
MadsRC 5 months ago
This is quite interesting.
What does a seat entail? You talk about self serve (I love it!), but would the users that self-serve take up a seat? Or are seats just for the folks creating the modules?
- coryodaniel 5 months ago
  Seats are for any user of the UI and they aren't bound to a person, so you can reassign them as necessary.
  For larger teams we do buckets of seats with price reductions per-seat as the team size goes up.
  We don't rate limit deploys - as that would make you less agile, we don't charge based on resources because that penalizes teams with stable infrastructure.
  We know ops budgets can be tight, we're ops engineers, so we try our best to make the pricing work well for those budgets.
zellyn 5 months ago
Having watched and participated in an online call (mentioned in another comment here), there were two big things that were strikingly cool about MassDriver:
* It turns the connections between components owned by different IaC tools/systems into _typed_ JSON. Think patch panels, where the connections are fully typed.
* Kelsey mentioned that you can introspect on the metadata in the live system using absolutely anything you want, down to just bash scripts. So it's _very_ hackable
- coryodaniel 4 months ago
  Hey! How's it been? I followed you on bsky a few days ago!
  Glad you liked it! Yeah, the typed connections are a big part of what makes Massdriver powerful. It makes sure infrastructure components integrate right without devs having to worry about all the low-level details.
  We're expanding the graph metadata with a querying system coming out of alpha soon that lets you ask stuff like "Where are all my t3 instances in production?" or "Which services are using a kubernetes version less than 1.25" Makes it way easier to understand what’s running where.
  And since it’s all API-first, it’s easy to write quick scripts for reporting or automating changes across environments.
  Aeolun 4 months ago
  Isn’t that something you’d easily get out of any IaC tooling dashboard?
  I’ll admit a graph query makes it easier, but the information is there.
  dwill-mdcloud 4 months ago
  There are probably some dashboards that can give you that information easily. Where having the graph is interesting is scripting something that can test changes to a widely used module. With all of your config behind an API you could pull the live config of every instance of that module, run your change against it, and then trigger a deployment of each one of the modules in isolation with the new code. The least interesting part of the API is search.
siliconc0w 4 months ago
What we've developed that is similar is a meta-layer above terraform to take high-level intent and use that to generate terraform which is then committed to git and auto-actuated.
This is nice because it's still all using git and you can continuously merge in updates as the 'best practice' definitions in the intent get updated. It allows teams to maintain their own customizations and eventually promote them to up to the intent level to capture them as a best practice so other teams can benefit from them.
- coryodaniel 4 months ago
  I love the idea of high-level intents and abstractions. They’re tough to get right, but when they click, they make dev teams move way faster.
  Cloud APIs mix operational and developer concerns, which is part of the problem. A single API call for something like a database forces you to think about instance types, failover strategies, and security settings when all a developer really cares about is, "I need a database that can handle X traffic and supports Y extensions."
  I’m actually working on a write-up about abstractions for a CNCF infra white paper and would love to get your thoughts. A lot of teams struggle with the balance between standardization and flexibility, and it sounds like you’ve thought a lot about this too. Let me know if you’d be up for a chat.
  Also, here’s a post I wrote about it recently:
  https://www.massdriver.cloud/blogs/the-case-for-abstractions...
tnolet 5 months ago
Wanted to check your status page to get a feeling for stability. Is your status icon in the footer hard coded, or is there an actual status page?
- coryodaniel 5 months ago
  It hits our public health check. We don't have a dedicated page for our different systems, but this is on our radar.
ahmedsalah 4 months ago
Really interesting approach to streamlining IaC. I was wondering, have you considered a free tier or trial option (maybe limited to a few boxes or a single environment) for early-stage teams? I'd love to play with it and maybe migrate our infra next quarter.
- dwill-mdcloud 4 months ago
  We have a 2-week free trial. That can be extended for you if you need a longer POC period. We also have some deals for early-stage companies we can talk through. Book a demo with us and let's talk through your use case!
jedberg 5 months ago
Is there any way for me to submit a module or co-author one with you guys for my cloud service?
- coryodaniel 5 months ago
  Yeah, there definitely is. Ping me @ cory @ massdriver.cloud
DrBenCarson 4 months ago
Before taking some of these comments in earnest, remember that the hn readership has a bias
But I do think you need to have a better free tier or consider going open source if you want to pick up momentum
- dwill-mdcloud 4 months ago
  I appreciate the feedback! We will continue to consider that.
  We had a free tier for years, but we saw over and over that the organizations that gained value with Massdriver had skin in the game. They wanted to focus Ops culture around our self-service model and paying kept them focused on adoption.
  We want to accommodate everyone who wants to manage Ops better and will continue to figure out how we can best do that Whether that is OSS, education or a better free tier.
  DrBenCarson 4 months ago
  Right on. I just don’t have a way to fiddle with it. I’m an engineer at heart but sometimes involved in procurement decisions—if I can’t poke around self-serve, I’m not likely to pay
  IMO your free tier doesn’t need to support organizations, just individual hobbyists
loveiswork 4 months ago
I love all of the contextual metadata, huge potential here
bzmrgonz 4 months ago
YOU SHOULD KNOW AVG flags the site with malvertisement, one of the api calls I think. You might wanna do a site scan to make sure.
- dwill-mdcloud 4 months ago
  Thanks for letting us know! We will look into that.
ge96 5 months ago
damn thought it'd be a space launch platform
- coryodaniel 5 months ago
  If I become a billionaire from this, that's my next startup.
hkwerf 5 months ago
Just an FYI: If I select "Features" / "API First" on your page, I end up at a 404.
- coryodaniel 5 months ago
  Thanks, restructured the docs site recently. Updating now!
JojoFatsani 5 months ago
How much do you have to pay Hightower and Majors to show up on the About Us page?
- zellyn 5 months ago
  Kelsey is a technical advisor to MassDriver. He introduced them on this live chat: https://www.youtube.com/watch?v=LKmKxKafdS0
  zellyn 5 months ago
  fwiw, the part in that demo where they add one type of object, and it automatically added the necessary related objects due to the inputs and outputs being typed is :chefs-kiss:
backyardflock 5 months ago
[flagged]
- zellyn 5 months ago
  At one point I planned to tease the team about the name being in the same category as "Blade Runner" — 300% cool-sounding, 0% unrelated to the thing itself — when I realized that it's a way of Driving IaC at Mass scale. So, not bad.
- coryodaniel 5 months ago
  Hey, we're just trying to accelerate your launch ;)
  I actually have a boilerplating tool I use to set up projects and it picks a random name from a text file at ~/cool-names.txt ...
  generalizations 4 months ago
  Kinda reminds me of Canonical MAAS? This seems to operate at a different level though.
  coryodaniel 4 months ago
  I've never seen this. Very cool!
xyzsparetimexyz 4 months ago
Pet peeve but I hate it when people named something lame after something much cooler. Same problem with Ansible, Raytracter etc. It feels like you're compensating for how lame the product is.