April 2021

Featured Post

How the technology gap grew during 2020

Even before the emergence of a global pandemic that disrupted and catalysed modern workplace IT, IT professionals were under pressure. In a few years, many have learned to deliver more new technologies and advanced services than perhaps in the previous decade. Today's hybrid, distributed tech environments already pressured even senior admins to equip themselves with new skills to adapt to these changes and manage systems confidently.

Five worthy reads: Is DataOps the next big value driver in the analytics ecosystem?

Five worthy reads is a regular column on five noteworthy items we’ve discovered while researching trending and timeless topics. In this edition, we’ll learn about DataOps, an interesting methodology that can help organizations fast-track their data analytics operations.

Product Training - Beyond Infrastructure Map & Monitor Critical Applications

SquaredUp’s Lead Solutions Engineer, Ashley Thompson, covers Enterprise Applications in depth, including availability tests, link monitoring, status messaging, and infrastructure mapping, and how to utilise Enterprise Applications to inform your service desk and beyond.

NiCE VMware Monitoring Adds Value 2021Q2

Virtualization is part of many IT environments and a very effective way to reduce expenses while boosting efficiency and flexibility. VMware monitoring using the NiCE VMware Management Pack for Microsoft SCOM enables you to ensure maximum performance and availability of your VMware vSphere and ESXi environments. The NiCE Management Pack enables insight beyond the virtualization layer and discovers how the virtualization configuration impacts your application services and end-user experience.

The Application Blame Game - New Survey Reveals Troubling Trends in IT

Studies consistently show that a positive UX (user experience) drives revenue growth, repeat business and brand loyalty. Here’s a good example: in Robert Pressman’s book Software Engineering: A Practitioner’s Approach, he writes “For every dollar spent to resolve a problem during product design, $10 would be spent on the same problem during development, and multiply to $100 or more if the problem had to be solved after the product’s release.”

Guide to using Docker for your CI/CD pipelines

Docker is a platform for developers and sysadmins to develop, deploy, and run applications using containers. Docker is also referred to as an application packaging tool. This means that enabled applications can be configured and packaged into a Docker image that can be used to spawn Docker containers that run instances of the application. It provides many benefits including runtime environment isolation, consistency via code, and portability.

SquaredUp 5.1 is here

We are delighted to announce that SquaredUp 5.1 is now available! With this latest update, we are introducing new integrations and visualizations that extend the picture of your business services and applications by unlocking even more of your data that is trapped within silos. You can now get insights on your enterprise applications from any angle! These features are available in all our products, including our newest product Dashboard Server.

Free ITIL v4 training from ITIL 4 co-author Barclay Rae | ITIL4 implementation in non-IT areas | ESM

Learn about #ITIL4 and #enterpriseservicemanagement from one of the ITIL 4 architects and co-author on how to extend the scope of ITIL 4 beyond IT and best practices to benefit your organization. With the developments in ITIL 4 in recent years, organizations are now appreciating the real meaning of value “co-creation,” and the need for collaboration across organizations. So there is a lot of development and collaborative work taking place to synthesize and integrate work, tasks, and value streams across departments, teams, and functional groups—all well beyond IT.

Explainer Video: Splunk for Infrastructure Monitoring and Troubleshooting

Wherever you are in your cloud journey and whatever your environment looks like, Splunk can monitor the performance of all your servers, containers and apps in real-time. Get real-time observability for data from any cloud, any vendor, and any service. Try our free Infrastructure Monitoring Trial and see for yourself.

Importance of Data Security For Mobile Healthcare Apps

Telemedicine or mobile healthcare apps are unquestionably important in our daily lives, as the world is increasingly shifting to an all-digital landscape. And when you think of the current pandemic scenario when social distancing and wearing masks are the new normal, nothing seems to be safer than using mobile health apps. But are we really safe while using these apps? Well, that brings us to the concern of being safe on the internet while using mobile healthcare apps.

See Inside the Datadog Platform

Datadog offers a single unified platform to monitor your infrastructure, applications, networks, security threats, UX, and more. For full visibility, you can seamlessly navigate between metrics, traces, and logs. Built-in machine learning tools, clear visualizations, and a companion mobile app make it easy to monitor growing environments. See inside any stack, any app, at any scale, anywhere.

End-to-End Observability Drives Great Digital Experiences

Mike Cohen, Splunk’s head of product management for network monitoring, joins theCube’s John Furrier for a conversation about how networks are an untapped source of data to help your organization achieve observability — and how to unlock that potential. Why understanding data flow and service interactions is key to understanding your systems Why distributed systems can cause extra troubleshooting issues — and what you need to know to fix them through network performance monitoring

Keeping Watch Over Microservices and Containers

Splunk Director of Product Management Craig Hyde joins theCube’s John Furrier for a conversation in the Leading With Observability series. They discuss the importance of digital experience monitoring, especially as the world sees a boom in remote, online business and increasingly complex technological infrastructures. Why starting with the end user in mind is critical for setting observability goals How full-fidelity end-end tracing impacts troubleshooting, to detect and alert in seconds

Under the Hood With Splunk Observability

Splunk Distinguished Architect Arijit Mukherji joins theCube’s John Furrier for a conversation about the value of having a holistic view of observability — and the right solutions — to help you achieve your business goals. Signs that your tool sprawl is becoming a big problem in dealing with the inherent complexities of modern IT environments Why full-fidelity ingest can be an observability superpower How real-time streaming analytics can improve MTTI and MTTR

Network Observability for Distributed Services

Mike Cohen, Splunk’s head of product management for network monitoring, joins theCube’s John Furrier for a conversation about how networks are an untapped source of data to help your organization achieve observability — and how to unlock that potential. Watch this segment of Leading With Observability on theCube to learn about addressing the gaps in your visibility, including: The ins and outs of monitoring metrics, distributed tracing and correlating logs with no management complexity

Un-Excuse-ing Upgrades

When we talk about upgrades here at SolarWinds, we spend a lot of time discussing the beneficial features, performance, and capabilities you can gain. That’s not by accident. The honest-to-goodness truth is, the most compelling reason to upgrade ANYTHING—from our phone to our game console to our monitoring software—is because we’ll be able to do something both new and useful to us.

How to build a CI/CD pipeline with Docker

I talk with many of my fellow engineers at conferences and other events throughout the year. One thing I like demonstrating is how they can implement a continuous integration/continuous deployment (CI/CD) pipeline into a codebase with very little effort. In this post I will walk through some demo code and the CircleCI config that I use in the demonstration. Following these steps will show you how to implement CI/CD pipelines into your code base.

Dashbird becomes Gartner Cool Vendor 2021!

We’re officially cool! Dashbird is extremely proud to be named as a Cool Vendor by Gartner in Monitoring, Observability, and Cloud Operations in their 28 April 2021 report on “Cool Vendors in Monitoring, Observability and Cloud Operations”. “Dashbird provides a novel approach to observability for serverless applications that run inside an AWS environment.

A guide to website uptime monitoring with UptimeRobot

Your website is your primary storefront on the internet and any website issues can lead to customer dissatisfaction and lost business. Which is why it is important to monitor your website to make sure that it is working properly. In this guide, we will learn how to set up website uptime monitoring with UptimeRobot.

Interlink Software and AppDynamics deliver unified, data-driven Service Visualization and faster fault resolution.

We are delighted to share news of our partnership with leading, real-time Application Performance Monitoring (APM) vendor Cisco AppDynamics and are now a fully-fledged member of their Integration Partner Program (IPP.) For our mutual enterprise customers service affecting issues can lie undetected in the vast volumes of data generated by the multiple, disconnected tools used to monitor their multi-cloud environments, applications and technical solutions.

How To Improve Workflow Productivity And Operational Efficiency?

Did you know that an average worker spends approximately 3 hours per 8 hour workday in personal or unproductive work? A significant amount of time is lost in repetitive or administrative tasks, which do not aid in increasing project productivity. Instead, they adversely impact employee satisfaction and efficiency by robbing substantial project time. As such, improving workflow productivity and efficiency has become a key challenge for organizations today.

Five Reasons To Choose Dell and Robin Cloud Native Platform For AI/ML (Blog series - Part 3 of 3)

In part 1 and part 2 of this series, we examined how AI/ML can help improve healthcare and the challenges faced by AI/ML teams in realizing the benefits respectively. In this part, we will explore how Robin and Dell can help overcome these challenges.

Announcing support for the AWS managed Lambda Layer for OpenTelemetry

Datadog’s support of OpenTelemetry—a vendor-agnostic, open source set of APIs and libraries for collecting system and application telemetry data—has helped thousands of organizations implement monitoring strategies that complement their existing workflows. Many of our customers leverage OpenTelemetry for their server- and container-based deployments, but also need visibility into the health and performance of their serverless applications running on AWS Lambda.

Incident Response Alert Routing

You have identified a data breach, now what? Your Incident Response Playbook is up to date. You have drilled for this, you know who the key players on your team are and you have their home phone numbers, mobile phone numbers, and email addresses, so you get to work. It is seven o’clock in the evening so you are sure everyone is available and ready to respond, you begin typing “that” email and making phone calls, one at a time.

Monitoring for Success: What All SREs Need to Know

The last ten years have seen a massive change in how IT operations and development enable business success. From virtualization and cloud computing to continuous delivery, continuous integration, and rapid application development, IT has never been more complex or more critical to creating competitive advantage. To support increasingly Web-Scale IT operations and wide-scale cloud adoption, applications now operate as services.

Seamless Cloud account management - The Future of Qovery - Week #8

During the next two weeks, our team will work to improve the overall experience of Qovery. We gathered all your feedback (thank you to our wonderful community 🙏), and we decided to make significant changes to make Qovery a better place to deploy and manage your apps. This series will reveal all the changes and features you will get in the next major release of Qovery. Let's go!

A Guide to AWS Certifications

If you’re interested in cloud computing, AWS certifications are one of the most rewarding paths to a dynamic career. As a worldwide leader in cloud infrastructure service, Amazon prepares certified experts who are highly sought after by IT organizations around the world. Did you know that 94% of organizations use a cloud service and 30% of their IT budgets are allocated to cloud computing?

JFrog Expands APAC Presence To Support Growing DevOps Adoption

At JFrog, we’ve seen DevOps and DevSecOps adoption growing robustly in Asia-Pacific (APAC), as the region’s large enterprises recognize the competitive advantage and importance of DevOps and digital transformation. In fact, by 2025, up to 25% of Asia’s 500 largest companies will become software producers to digitally transform and maintain their A500 status, IDC predicts1.

Various Types of Software Bugs - Vol1

It’s been said that every line of code is a liability. We have to write code to create software, but every time we do we create the possibility of software bugs—or “bugs” for short. But what is a bug exactly? A bug is something that is either entirely wrong or not quite right with software. It can lead to wrong outputs or behaviors, or even crash the whole system.

7 Ways SRE Is Changing IT Ops And How To Prepare For Those Changes

SRE best practices are disrupting and catalyzing change in the ways organizations approach IT Operations. In this blog we look at 7 ways SRE is bringing this transition. ‍Site Reliability Engineering is a new practice that has been growing in popularity among many businesses. Also known as SRE, the new activity puts a premium on monitoring, tracking bugs, and creating systems and automations that solve the problem in the long term.

Covea Insurance drives customer service innovation with ServiceNow

The insurance sector is incredibly competitive and tightly regulated, so being able to provide great customer service is vital to success. And having the right technology is imperative to delivering quality service. At Covéa Insurance, the UK arm of France’s top mutual insurance group, we provide commercial, motor, high-net worth, property, pet, and protection insurance to more than 2 million customers.

How to monitor Microsoft SQL Server with Prometheus

In this article, you will learn how to monitor SQL Server with Prometheus. SQL Server is a popular database, which is very straightforward to monitor with a simple Prometheus exporter. Like all databases, SQL Server has many points of failure, such as delays in transactions or too many connections in the database. We are basing this guide on Golden Signals, a reduced set of metrics that offer a wide view of a service from a user or consumer perspective.

Key Kubernetes Metrics and Resources to Monitor for Peak Cluster Performance

Monitoring is not easy. Period. In our guide to Kubernetes monitoring we explained how you need a different approach to monitoring Kubernetes than with traditional VMs. In this blog post, we’ll go into more detail about the key Kubernetes metrics you have access to and how to make sense of them. Kubernetes is the most popular container orchestrator currently available. It’s available as a service across all major cloud providers. Kubernetes is now a household name.

How to Monitor Zoom Network Performance | Obkio

Zoom’s popularity has skyrocketed over the past year. It’s not only an application that we use for convenience, but for many of us, we rely on it for everyday conversation VoIP Quality and unified communication applications, like Zoom, can be drastically impacted by poor network performance. So monitoring network performance helps you identify performance issues & improve your Zoom performance.

What's new in Grafana Enterprise Metrics 1.3, our scalable, self-hosted Prometheus service

We built Grafana Enterprise Metrics (GEM) to empower centralized observability teams to provide a multi-tenanted, horizontally scalable Prometheus-as-a-Service experience for their end users. The GEM plugin for Grafana is a key piece of realizing this vision. It provides a point-and-click way for teams operating GEM to understand the state of their cluster and manage settings for each of the tenants within it.

Threat Hunting Frameworks and Methodologies: An Introductory Guide

Creating an effective threat hunting program is among the top priorities of security leaders looking to become more proactive and build active defenses. Yet finding the right expertise to staff a hunt team remains a challenge, with 58% of respondents in a recent SecOps survey saying they felt their organization’s investigative skills and capabilities were in need of improvement.

6 Steps to Getting Started With Observability

During my office hours, I frequently get asked for practical tips on getting started with observability. Often it’s from folks on teams who are already practicing continuous delivery (or trying to get there) and are interested in more advanced practices like progressive delivery. They know observability can help—but as individual contributors—they don’t sign the checks, so they feel powerless to help get their team started with observability.

Continuous integration for React applications using Jest and Enzyme

React continues to be the web framework of choice for many UI developers, second only to jQuery, according to Stack Overflow. It provides an intuitive model for building data-driven user interfaces, and efficiently updates the DOM when this data changes. React pairs nicely with Redux, which enables managing the data that React needs to render interfaces. Redux offers a predictable way to structure and update the data in those frontend applications.

Dashboard Server: Working with the SQL tile

In my previous blogs in the Dashboard Server Learning Path, we looked at working with the Web API tile and the PowerShell tile. In this instalment, let’s try the SQL tile. This tile will let you connect to any SQL database and run a SQL query straight from SquaredUp. This tile is also available in both the SquaredUp for SCOM and Azure products, so I have some familiarity with it already.

Learning to Learn by Teaching DevOps | Kat Cosgrove on 99 Percent Visible

I'm a Developer Advocate. That means that ultimately, my job is to teach people things. Over the last year and some change, I've given dozens of talks and workshops about DevOps, the majority of them educational in nature. The way I approach developer education has changed pretty radically over that period of time – more difficult for me in some ways, but better for my audience in every way. The assumptions I make are different now, and the way I communicate has changed, too.

Instana Customer Case Study: Immocloud Unlocks Observability to Modernize Real Estate

Immocloud is a new online solution that makes managing real estate more efficient by fully digitizing processes between tenants and landlords. Users of Immocloud are able to provide restricted access to tax and bank consultants, as well as a variety of other necessary landlord partners, such as cleaning services and maintenance. A variety of digital services are offered through Immocloud, including banking integrations, document management, as well as landlord-to-tenant communication tools.

Accelerating DevOps Using Cloud Native Technologies With AWS, Docker & JFrog

In this webinar we help you gain a deeper understanding of the benefits of migrating and modernizing applications from a monolithic architecture to microservices, in order to accelerate DevOps processes. We outline the efforts required to reach this stage of sophistication in application development and deployment.

The essentials of central log collection with WEF and WEC

Last week we covered the essentials of event logging: Ensuring that all your systems are writing logs about the important events or activities occurring on them. This week we will cover the essentials of centrally collecting these Event Logs on a Window Event Collector (WEC) server, which then forwards all logs to Elastic Security.

Your Cloud Optimization Questions Answered! Here are our FAQs

We created a survey to ask our consumers about their cloud to gain insight into how we can provide the best possible solution to our customers. 270 people responded in the tech community of all different job titles and cloud providers. This survey gave us lots of valuable insight into how companies are using the cloud for their business and their pain points, which is where we come in.

Logz.io and the AWS Distro for OpenTelemetry

Amazon Web Services has announced enhanced support for the open-source distribution of the OpenTelemetry project for its users. AWS Distro for OpenTelemetry (ADOT) now includes support for AWS Lambda layers for the most popular languages and additional partners integrated into the ADOT collector. And one of those partners is Logz.io! Logz.io is happy to announce that our exporter is now included in the AWS Distro for OpenTelemetry.

Announcing Calico Enterprise 3.5: New ways to automate, simplify and accelerate Kubernetes adoption and deployment

We are thrilled to announce the availability of Calico Enterprise 3.5, which delivers deep observability across the entire Kubernetes stack, from application to networking layers (L3–L7). This release also includes data plane support for Windows and eBPF, in addition to the standard Linux data plane. These new capabilities are designed to automate, simplify and accelerate Kubernetes adoption and deployment. Here are highlights from the release…

How NOT to take a side project to startup - Dev Matters S1E03

What side projects make terrible startups? What should you avoid when trying to make the transition? In this episode of Dev Matters, Don and his guest Dylan Etkin discuss side projects and lessons learned taking Sleuth from a side project to a startup. This episode was recorded in front of a live studio audience on Twitch.

How To - Monitor Split Tunnel Traffic with Catchpoint

When the world transitioned to a remote workspace, one of the things that most of us figured out quickly was that some applications just don’t work well with corporate VPN. Video and voice applications, like Microsoft Teams, are essential to business operations. I wouldn’t want to add another point of failure that I’d need to troubleshoot if I didn’t have to.

Extinguishing our performance fires and rebuilding for the future

I stepped into the role of Head of Engineering for Bitbucket Cloud in late 2020, having served as one of the team's senior engineering managers for several years. It is an honor and a privilege to lead this team, and I couldn't be prouder of the hard work we've done and continue to do each day to make Bitbucket a world-class product empowering teams to build, test, and deploy software to millions of people around the world. It has been an eventful journey, and the past few weeks are no exception.

4 Major Capabilities of Automated Incident Management

Automated incident management ensures that critical events are detected, addressed and resolved in a fast, efficient manner. Automation allows incident management tools to integrate with each other and fosters instant communication across the systems. Automation tears down barriers across IT operations (ITOps) teams and ensures all departments are on the same page. Teams gain full visibility into incident status to verify that incidents are addressed by the relevant groups.

How to Improve Kubernetes Management and Administration with LogDNA

In this video, we will show how LogDNA helps DevOps teams using Kubernetes to consume, control and collaborate with logs. By providing value to data from every source, including Kubernetes, developers are empowered to leverage logs to ensure they can continue to accelerate development cycles, and Ops teams can easily onboard microservices teams without the need to modify their infrastructure.

Q&A from the Moogsoft/Datadog Fireside Chat

On April 15th Moogsoft’s VP Marketing, John Haley, welcomed Datadog Product Manager, Alex Vetras, along with DevOps Institute Chief Ambassador, Helen Beal, and Moogsoft’s CTO, Dave Casper, for an informal roundtable exploring how users can now see rich-context incidents from across the full stack in minutes, and the opportunities this presents to organizations.

Fireside Chat with Jesse Robbins and Kolton Andrus Failover Conf 2021

Long before Chaos Engineering was even a phrase, Jesse Robbins was Amazon.com's "Master of Disaster" using intentional failure to help the company become more reliable. Kolton Andrus (CEO at Gremlin), sits down with Jesse to learn more about his early work with GameDays, the evolution of reliability, and where the future of SRE lies.

Fireside Chat with Ines Sombra and Ana Medina Failover Conf 2021

Reliability is a requirement for the modern internet. Ana Medina joins Inés Sombra, Sr. Director of Engineering at Fastly, to discuss their approach to resilience, how the past year has influenced the way they work, and what practices your engineering organization can adopt to become more reliable.

The Case for Cost Containment With Professional Services and Staff Augmentation

At the start of our enterprise cost containment series, Leon examined what cost containment is and why IT pros should care. Basically, you should care how your projects and work affect the business’s bottom line and should understand enough about it that you can communicate effectively with management. Even though you may not think about the business in this way most of the time, it directly affects your job.

The IT Skills Gap-A Downside of Innovation

Innovation is widely accepted to be a great thing—think of all the new products, technologies, methodologies, services, etc. unveiled at any given time. At this point, you’re probably thinking, "This all sounds great! Why would someone be writing about a downside of innovation?" Innovation is great when it pushes the boundaries of what can be achieved and inspires people to build upon things others have built or dreamed of. But innovation is useless without adoption.

Search for files and document contents in Mattermost

Finding the right information in Mattermost is critical to work smarter and be more productive. Searching in Mattermost now finds both relevant messages and files in your team’s conversation history. Search will return results for attachments that match the file name or contain matching text content within supported document types. File search is available today in Mattermost Cloud and in Mattermost Self-Managed v5.35 (available May 16), with mobile support coming soon.

What Are AWS Lambda Triggers?

This is a basic introduction to Lambda triggers that uses DynamoDB as an event source example. We talk a lot about the more advanced level of Lambda triggers in our popular two-part series: Complete Guide to Lambda Triggers. If you want to learn more, read part one and part two. We’re going back to the basics this time because skipping some steps when learning something new might get you confused. It tends to get annoying, or it can even make you frustrated. Why?

Using Maths to Fight Financial Crime

Financial crime has become a red-hot topic over the last 12 months, as fraudsters have sought to exploit the monitoring gaps between people, process and technology across an ever-widening attack surface – driven by the growth in usage of remote (digital) channels. Even before its recent growth, the cost of fraud and financial crime was significant.

Atlassian Open DevOps Overview Video

Software and DevOps teams have everything they need to develop, ship, and operate software in Atlassian Open DevOps, an development experience built on Jira Software.Open DevOps starts with Jira Software, Confluence, Bitbucket, and Opsgenie. Teams can easily add the tools they want, such as GitHub or GitLab, with a single click. In this video get an overview of Open DevOps and how it can supercharge your development.

Security Starts With People, Unfortunately.

One of the findings of our investigations pointed to a compromised account as a possible cause of the cyberoperation. While I was reading the results, I thought, “That could have been me. In fact, it could have been any of our employees,” and I began asking myself what I as an individual could do to increase the security of the company I’m working for. Let’s face it, most of the risk is produced by us humans and our behavior.

Using Coralogix to Gain Insights From Your FortiGate Logs

FortiGate, a next-generation firewall from IT Cyber Security leaders Fortinet, provides the ultimate threat protection for businesses of all sizes. FortiGate helps you understand what is happening on your network, and informs you about certain network activities, such as the detection of a virus, a visit to an invalid website, an intrusion, a failed login attempt, and myriad others. This post will show you how Coralogix can provide analytics and insights for your FortiGate logs.

JFrog and PagerDuty Extend Ecosystem Integration

JFrog and PagerDuty have deepened their technology integration to further boost IT operators’ and developers’ visibility into the software development lifecycle and accelerate incident resolution. The latest integration, which involves the JFrog Pipelines DevOps pipeline automation solution, simplifies and streamlines how to identify faulty builds that impact production environments.

Centralized Log Management for Multi-Cloud Strategies

The future of enterprise IT stacks is the cloud. In fact, according to a 2019 Gartner post, when we say “cloud infrastructure,” 81% of people really mean multi-cloud. Considering the analyst took this survey prior to the pandemic, we can safely assume that the number of companies with multi-cloud stacks is probably higher than this. Companies choose a multi-cloud strategy for a lot of reasons, including making disaster recovery and migration easier.

Are Software Developers Ready for Real-Time Payments?

Well, it’s official. The COVID-19 pandemic dramatically accelerated the digital economy, and there are no signs of letting off the gas. According to the U.S. Commerce Department, the COVID-19 pandemic resulted in a 44% increase in eCommerce sales from 2019 to 2020. Individuals increasingly prefer contactless payment methods – think holding up your phone to the credit card terminal – as a way to prevent the spread of coronavirus.

Have Your Say in the new Idea Portal

We’re excited to announce the launch of the all-new idea portal. A place where great ideas can grow, build support, and help shape the future of Auvik. The idea portal allows us to collect your suggestions while keeping you informed of what we’re working on, and what we’re planning to implement next. We can’t wait for your input!

Building CI/CD pipelines using dynamic config

Creating robust, manageable, and reusable functionality is a big part of my job as a CI/CD engineer. Recently, I wrote about managing reusable pipeline configuration by adopting and implementing pipeline variables within pipeline configuration files. As I showed in that tutorial, pipeline variables and orbs have added some flexibility to this process, but they are still a bit limited.

Introducing dynamic config via setup workflows

With the new release of dynamic config via setup workflows, CircleCI customers can now use jobs and workflows, not only to execute work but to determine the work they want to run. We built dynamic config because we know our users want more dynamism in the CircleCI build process. Historically, our platform has been very deterministic: the config is pre-set in a file based on the revision for a given pipeline.

Exploiting and detecting CVE-2021-25735: Kubernetes validating admission webhook bypass

The CVE-2021-25735 medium-level vulnerability has been found in Kubernetes kube-apiserver that could bypass a Validating Admission Webhook and allow unauthorised node updates. The kube-apiserver affected are: You are only affected by this vulnerability if both of the following conditions are valid: By exploiting the vulnerability, adversaries could bypass the Validating Admission Webhook checks and allow update actions on Kubernetes nodes.

Our $188M funding round fuels our mission to help customers confidently run modern cloud applications

Today, I am excited to share that we secured $188M in a new funding round, at a valuation of $1.19B (read more here). At the outset, I want to thank our employees, partners, investors and most importantly, our customers for this important milestone. The funding follows a year of unmatched innovation that led to accelerated revenue growth, installed base growth, and rapid community adoption of our open source projects.

The Simple Formula To Calculate SaaS Gross Margin

As the Software as a Service (SaaS) industry continues to grow, finding ways to win on gross margin becomes increasingly important. By finding sustainable strategies that either increase revenue or limit the cost of goods sold (COGS), SaaS companies can gain a competitive advantage and continue pursuing their broader growth objectives. While all SaaS brands want to be “profitable”, broadly speaking, there are many different ways to define what profitability actually means.

Get instant Grafana dashboards for Prometheus metrics with the Elixir PromEx library

I have been using Grafana for almost four years now, and in that time it has become my go-to tool for my application observability needs. Especially now that Grafana allows you to also view logs and traces, you can easily have all three pillars of observability surfaced through Grafana. As a result, when I started working on the Elixir PromEx library, having Grafana be the end target for the metrics dashboards made perfect sense.

Introducing 2-way REST capabilities with Enterprise Alert 9

The REST API in Enterprise Alert 9 has now been extended with a 2-way functionality. This allows to call webhooks or REST endpoints from third party systems on alarm status changes (acknowledge, close). Thus, in Enterprise Alert 9, it becomes child’s play to establish a 2-way integration with almost any REST enabled third party system.

Enhanced customer experience: A conversation with Rogers Communications

The success of a modern business starts with recognizing the value of the customer and committing to provide a great experience. The best companies do this by putting respect and empathy at the core of every interaction. In today’s new era of work, delivering an enhanced customer experience has become increasingly connected with the employee experience. This creates new opportunities for organizations to drive stronger satisfaction for both parties simultaneously.

The Future of Work - Two Scary Statistics from Aternity's Webinar with 451 Research

Aternity recently sponsored a webinar with 451 Research, part of S&P Global Market Intelligence, on the future of work. Two scary statistics on the impact of end-user technology on employee engagement, satisfaction, and productivity were presented. This blog covers how Aternity helps organizations address the challenges of hybrid work environments.

Searching through logs with the free and open Logs app in Kibana

Log exploration and analysis is a key step in troubleshooting performance issues in IT environments — from understanding application slow downs to investigating misbehaving containers. Did you get an alert that heap usage is spiking on a specific server? A quick search of the logs filtered from that host shows that cache misses started around the same time as the initial spike.

Agent installation options for Google Cloud VMs

Site Reliability Engineering (SRE) and Operations teams responsible for operating virtual machines (VMs) are always looking for ways to provide a more stable, more scalable environment for their development partners. Part of providing that stable experience is having telemetry data (metrics, logs and traces) from systems and applications so you can monitor and troubleshoot effectively.

New functionality for modifying server parameters in Pandora FMS

This video shows the new editor located in the Pandora FMS web console to be able to modify some parameters of the server configuration file. Do you need to monitor your services but you have less than 100 devices? In this video we will show you the available options Pandora FMS Lite 35 and Lite 70.

Leaving the Nest: Guidelines, guardrails, and human error by Laura Santamaria Failover Conf 2021

When we talk about reliable systems, we talk a lot about human error. Human error in an incident or a bug report is often treated with a bit of a facepalm reaction. The term masks a lot of scenarios from accidents to exhaustion to everything in between. However, human error helps us understand where our processes failed and how we can prevent the same error from happening again. In short, we need to think in terms of a framework of guidelines and guardrails. In this short talk, let’s discuss how guidelines like runbooks and guardrails like automation can help us address the fact that everyone will, at some point, make mistakes.

Implementing DevSecOps in the DoD by Nicolas Chaillan Failover Conf 2021

Delivering software quickly and securely is important for every organization, but it's even more important at the US Department of Defence (DoD) where reliability directly impacts national security. Nicolas Chaillan (Chief Software Officer, US Air Force) will discuss the DoD Enterprise DevSecOps Initiative—an initiative he leads along with the DOD’s Chief Information Officer that brings automated software tools, services and standards to DoD programs. He'll also share about Platform One, the Air Force's DoD-wide DevSecOps Enterprise Level Service that provides managed IT services capabilities, on-boarding, support, and baked-in zero trust security. This insight from operating at the most rigorous level will help you level up your own organization.

Pragmatic Incident Response: Lessons learned from failures by Robert Ross Failover Conf 2021

Incident response is overwhelming. So where do you start? There's a lot of advice out there, but it's mostly theories that aren't taking reality into account. So how do you get a process in place that actually works and scales? In this session, FireHydrant CEO and Co-Founder, Robert Ross, will share quick stories from his experience as an SRE and what tips he’s learned along the way.

Whats Next for DevOps by Emily Freeman  Failover Conf 2021

For over a decade, the DevOps movement has been using cultural change to power technological transformation and help companies deliver better products faster and more reliably. While many organizations have embraced this change and reaped the benefits, it hasn't come without challenges and many more remain. In this session, Emily Freeman (author of DevOps for Dummies) shares what's next for DevOps and how it will impact your organization.

The Evolution of Observability and Monitoring panel discussion Failover Conf 2021

Observability and monitoring are critical to detecting and troubleshooting problems to build more reliable applications. As our systems become increasingly complex, our tools for getting this crucial visibility and the way we respond need to evolve too. We'll sit down with SRE leaders to discuss the processes they use to get the most insight into their applications, how they've increase the speed of detection and response, and what organizations need to do to stay on top of growing complexity.

The Evolution of Teams & Culture panel discussion Failover Conf 2021

The most successful organizations are the ones that embrace change and use it to become stronger and more resilient. In this panel discussion, we'll talk with engineering leaders about how they adapted to the challenges of 2020, what successes (and failures) they've seen, and where the future of reliable engineering teams is headed.

How an Experience Level Agreement can Benefit your Business

The success of a business is dependent on two key components: a quality product/service that is being offered and a team that can market and communicate about that product/service effectively. However, that team needs to first be able to communicate with each other to brainstorm and strategize. With many businesses still working on a remote or hybrid model because of the global pandemic, digital communication has become an invaluable part of productivity.

Atlassian Open DevOps and Codefresh

Codefresh is excited to partner with Atlassian on their new Open DevOps launch. Codefresh is offering native support for connecting the two platforms and giving better visibility on deployments and features of each deployment for our mutual customers. At the heart of this integration is the Codefresh App; which can be found on the Atlassian Marketplace. Simply define Codefresh as the CI/CD partner that will connect to Atlassian’s DevOps API.

Quick Demo: Ivanti UEM for Clients

Some of the many ways that malware, including ransomware, is commonly spread is through malicious attachments to business email, unsanctioned apps downloaded from third-party app stores, drive-by downloads via phishing and pharming attacks, employing brute-force tactics using Remote Desktop Protocol (RDP), and network propagation via SMB and CIFS sharing.

Mobile Devices are Ubiquitous, and so are Cyberattacks

We all like to enjoy untethered freedom, as is shown by the incredible growth of mobile devices we use every day for business and personal activities. We use mobile devices for buying products and services, and banking and investing. We download apps that allow us to connect with our favorite businesses and socially interact with friends and relatives.

SIEM Deployment Remains Strategic in 2021

Some great new research coming out of the survey data published by 451 Research on Enterprise spending for Information Security. There have been more advanced ways of trying to implement security controls and avoid security issues by integrating security into the development or continuous integration and release pipelines. Despite that, there is still strong interest in using log and event data to manage the security posture of an organization in a SIEM solution.

Integrate security into development with Snyk, now a seamless part of Bitbucket Cloud

While PR analysis within Code Insights and Snyk Pipes are available to use right now, we're rolling out a native Security tab in Bitbucket Cloud. This will be a gradual rollout through the month of May so watch out for it in the left nav. We look forward to your feedback. Even small vulnerabilities can cost a team a lot. All too frequently we see news reports of organisations that mishandled their code & build level security, causing customer data to be exposed.


Focalboard is an open source, self-hosted alternative to Trello, Notion, and Asana. It's a project management tool that helps define, organize, track and manage work across teams, using a familiar kanban board view. We're currently in early-access beta, and are looking for feedback. So please download it today and let us know what you think. Focalboard is open source! Check out the source code here, and contribute to the future of this project.

Mattermost plugins: How to set up your developer environment

The goal of this four-part series is to help you learn how to write your own Mattermost plugins for the first time. To kick things off, this article teaches you how to set up your developer environment. My test computer is a five-year-old laptop with an Intel i5 processor and 4GB of RAM. You need at least 30GB of hard disk for this project. Of course, you’ll also need an internet connection. We start with a freshly installed Ubuntu 20.04. You don’t need to install the desktop environment.

Four Key Challenges To Adopting AI/ML In Healthcare (Blog series - Part 2 of 3)

In part 1 of this series, we examined how AI/ML can help improve healthcare. AI/ML is an ambitious undertaking that promises to revolutionize healthcare. Getting excited is easy, but where do you start and why is it not just another empty promise? In fact, despite all these promises and futures, most AI/ML projects fail and don’t deliver. The failure rate of AI/ML projects is starting to make some wonder if this is real or hype.

Fostering Exceptional Microsoft 365 User Experiences

Enhanced visibility is crucial and to best meet current business needs requires an understanding of the level of satisfaction when using Microsoft 365. There is a growing demand to learn and know how users feel about the quality of their experience. Take a deep dive into the difference between Service Level Agreements and Experience Level Agreements and why the enhanced visibility is crucial to best meetyour current business needs.

Monitor cloud endpoint health with Datadog's cloud service autodetection

Your modern cloud-hosted applications rely on a number of key components—such as databases and load balancers—that are managed by the cloud provider. While these cloud resources can reduce the overhead of maintaining your own infrastructure, capturing and contextualizing monitoring data from services you don’t own can be difficult.

Launching RMM Central: A unified IT solution for managed service providers

We’re pleased to introduce ManageEngine RMM Central, a unified remote monitoring and management solution. Maintaining the IT infrastructure and systems of client networks is a herculean task for IT service providers. Multiple tools perform various capabilities in network management, be it maintaining or managing workstations, laptops, servers, and other networks.

Diagnosing latency: Lightstep vs. Jaeger

For many organizations starting out with distributed tracing, Jaeger is often the first tool used to ingest and visualize traces. It provides a way for developers to query for individual requests and see their behavior as they traverse all the services and operations to complete the request. As powerful as this is, it only provides a partial picture of your system’s performance because you are only able to visualize individual requests or at most compare two requests to each other.

DNS Load Balancing for Highly Available Enterprise WordPress Cluster

Scalability, high availability, and performance are fundamental to the success of a commercial product deployment. And if the workload includes multiple entry points for requests, it is important to properly set up smooth load balancing in order to achieve the required uptime and speed. One of the solutions in this case is to use a DNS service with origin servers health checks.

Five Reasons to Use Catchpoint for Measuring Core Web Vitals

We are in this together. As part of our continuous efforts to meet customer expectations, we have recently added Core Web Vitals to our performance measurement programs. We are happy to share that these metrics are now a native part of the Catchpoint Platform. DevOps’ SREs, Platform Operations Engineers, and business and monitoring strategists alike will realize a series of key benefits from this addition.

GKE operations magic: From an alert to resolution in 5 steps

As applications move from monolithic architectures to microservices-based architectures, DevOps and Site Reliability Engineering (SRE) teams face new operational challenges. Microservices are updated constantly with new features and resource managers/schedulers (like Kubernetes and GKE) can add/remove containers in response to changing workloads. The old way of creating alerts based on learned behaviors of your monolithic applications will not work with microservices applications.

What is detection engineering?

Just as threat actors evolve their attacks and techniques, so too must security teams evolve their detection content. Detection engineering, therefore, is a life cycle that requires continual effort. However, when done well, detection engineering can reduce the mean time to detect and respond to a threat, as well as recover from a threat. Detection engineering is the process of identifying threats before they can do significant damage.

Uncover How Your Employees Experience Their SaaS Applications in Real-Time

With employees depending on web applications every day, you can’t risk leaving anything to doubt when it comes to managing your IT estate. Although technology performance might appear “in the green” from IT’s perspective, how often are employees experiencing application outages or slowdowns you’re not aware of? Are they using that highly touted new app you rolled out – or avoiding it because of hidden usability problems?

What's Changed in VMware vSphere 7 Update 2: All You Need to Know

VMware has recently released vSphere 7 Update 2, and there is a lot of new stuff to look out for. vSphere, VMware’s server virtualization product, has been an industry favorite for a long time. The vSphere 7 came out in April 2020, and this is so far the second update to it, hence the name. When you look at the changes they’ve rolled out, you’ll know that they are really focusing on some key areas. As a result, VMware infrastructure is getting pretty solid and modern.

Getting Started with the Splunk Distribution of OpenTelemetry Java

Splunk Distro for OpenTelemetry is a secure, production-ready, Splunk-supported distribution of the OpenTelemetry project and provides multiple installable packages that automatically instruments your Java application to capture and report distributed traces to Splunk APM (no code changes required!), making it easy to get started with distributed tracing!

Integrating a Cloudsmith repository with a Harness CD pipeline

In this blog, we will walk through the process of configuring a private Cloudsmith repository as an artifact source for a Harness Continuous Deployment pipeline. Harness is a Continuous Deployment platform that allows you to easily automate the deployment of your software to your infrastructure and environments.

Announcing Services Discovery for tracking and improving service reliability

Gremlin helps teams proactively improve the reliability of their systems by running chaos experiments on infrastructure including hosts, containers, and Kubernetes clusters. But as microservice-based architectures and automated cloud platforms become the norm, engineers are shifting their focus from managing infrastructure to managing services. In order to keep these services as resilient as possible, they need tools that can help them find failure modes, reduce incidents, and improve availability.

Managing and troubleshooting Elasticsearch memory

Hiya! With Elastic’s expansion of our Elasticsearch Service Cloud offering and automated onboarding, we’ve expanded the Elastic Stack audience from full ops teams to data engineers, security teams, and consultants. As an Elastic support rep, I’ve enjoyed interacting with more user backgrounds and with even wider use cases.

Benchmarking Grafana Enterprise Metrics for horizontally scaling Prometheus up to 500 million active series

Since we launched Grafana Enterprise Metrics (GEM), our self-hosted Prometheus service, last year, we’ve seen customers run it at great scale. We have clusters with more than 100 million metrics, and GEM’s new scalable compactor can handle an estimated 650 million active series. Still, we wanted to run performance tests that would more definitively show GEM’s horizontal scalability and allow us to get more accurate TCO estimates.

How our Field Teams' Productivity Skyrocketed with our New AIOps Studio

Lately, I have seen fewer call outs from our field teams to our solution engineering team, and I was wondering what could be the reason? Sometimes, our field engineers approach our solution engineering team with advanced requests for data analysis, running what-if scenarios and assessing the quality of data and what new value can be gleaned by combining related datasets.

DevOps: Post-Mortem Reporting for Enhanced Operations

Migrating to a DevOps model of culture merges Development and IT Operations. This powerful unity enables organizations to release updates in smaller increments at a greater frequency. While this model reduces the risk of failure of an individual release and reduces the system development life cycle, it does not, however, necessarily decrease the number of incidents technicians need to respond to.

9 Best Cloud Logging Services for Log Management, Analysis, Monitoring & More [2021 Comparison]

Log management stopped being a very simple operation quite some time ago. Long gone are the “good old days” when you could log into the machine, check the logs, and grep for the interesting parts. Right now things are better. With the observability tools that are now a part of our everyday lives, we can easily troubleshoot without the need to connect to servers at all. With the right tools, we can even predict potential issues and be alerted at the same time an incident happens.

Improve Your CMDB for Business Outcomes with Application Dependency Mapping

A configuration management database (CMBD) is a centralized repository that stores information about all the significant entities in your IT environment. These can include your hardware, installed software applications, documents, business services, and even the people who are part of your IT system. The CMDB is designed to help you maintain and support the interrelationships between the configuration items (CIs) within a vast IT structure.

How Can Companies Integrate Ethical AI? | Splunk's Ram Sriharsha & Dr. Rumman Chowdhury

Organizations use AI to be more competitive, deliver better business outcomes and avoid falling behind. However, business leaders should know they pose their organizations’ serious risk if they do not comply with ethical standards. Leadership must enable teams to practice ethical business strategies, up-level talent strategy, and enable organizational resilience. Dr. Rumman Chowdhury and Ram Sriharsha, Head of Machine Learning at Splunk, discuss the challenges companies will face if they do not comply with ethical standards and how to solve for fairness and privacy.

How to deploy an application on Friday

No one likes giving their weekends up to fix release issues. Developers and operations teams are traditionally hesitant to make changes or deploy applications on a Friday, in case something goes wrong and they have to spend their weekend making emergency fixes. Or worse, trying to roll back changes that were made. However, with a strong set of practices and a reliable deployment pipeline, there should be no reason why a deployment cannot happen anytime — even on a Friday afternoon.

Chaos Engineering in 60 seconds - Attack a service

Learn how to run a chaos experiment on a distributed service using Services Discovery in Gremlin. Gremlin is the enterprise Chaos Engineering platform on a mission to help build a more reliable internet. Their solutions turn failure into resilience by offering engineers a fully hosted SaaS platform to safely experiment on complex systems, in order to identify weaknesses before they impact customers and cause revenue loss.

Announcing Services Discovery for tracking and improving service reliability

Gremlin announces Services Discovery for tracking and improving the reliability of distributed services. Gremlin is the enterprise Chaos Engineering platform on a mission to help build a more reliable internet. Their solutions turn failure into resilience by offering engineers a fully hosted SaaS platform to safely experiment on complex systems, in order to identify weaknesses before they impact customers and cause revenue loss.

15 Minutes to Beat the Pandemic and Get Your Patients Back to Work

It seems like the country is getting back on its feet, with millions of COVID-19 vaccinations being distributed every day. But unfortunately, we are not in the clear yet. 2020 was an unprecedented, challenging year, but it is crucial to remember that in 2021 we are in the pandemic’s fourth wave. According to an article by AARP, a dangerous “factor is pandemic fatigue.

The Future of Technology Operations with Charlie Betz of Forrester Research

Recently, we had the opportunity to hear from Charlie Betz of Forrester Research about the future of technology operations. He covered the shift from project-based work to product team orientation, the integration between DevOps and IT services and the ways in which the entire enterprise is getting value from investments in artificial intelligence. Check out some of the interesting questions and answers from the discussion and be sure to catch the full conversation in the video below.

GitOps Use Cases You May Not Have Considered

GitOps is growing in popularity. You’ve probably seen it mentioned on Reddit or dev.to. But what the heck is GitOps? Broadly speaking, GitOps takes the principles of Git and CI-powered workflows favored by software developers — commonly used to automate the process of building, testing and deploying software — and applies them to other business processes.

ServiceNow named a Magic Quadrant Leader in Enterprise Agile Planning Tools

ServiceNow has been named a Leader in the 2021 Gartner Magic Quadrant for Enterprise Agile Planning Tools for the first time. This is a big move for us after being positioned a Niche Player in 2019 and a Challenger in 2020. We believe this position is a result of our extended integration with third-party tools, our project and portfolio management (PPM) use cases, and our portfolio management capabilities.

Test Azure Service Bus Performance by Generating a Million Test Messages

For the people using Azure Service Bus namespaces – we often have the need to ensure the Azure Service Bus Performance by testing our system integration by generating some test messages on the Azure Service Bus resources. You might need this for QA/Development for performance testing, load testing etc. This blog will explain how to simulate the test environment using Serverless360 to check Azure Service Bus performance and its throughput.

Introduction to cron job monitoring with Healthchecks

Software teams use cron jobs to handle many important tasks like database backups and maintenance scripts. Cron jobs make sure that your applications are behaving as they should, but cron job failures are often silent and not noticed until the problem becomes worse. In this guide, we will learn how to stay aware about cron job failures by using Healthchecks.

What is Site Reliability Engineering [Simple Intro to SRE]

Wondering what SRE is all about? We will explain what it is, how it works, why it was developed, and how it can help your organization. So what is SRE (Site Reliability Engineering)? SRE is a methodology that fuses software and operations teams, with the goal of producing reliable, resilient, and scalable systems. Site Reliability Engineering (SRE) was developed by Google engineer Ben Treynor Sloss in 2003. Google’s goal was to increase the reliability of its sites and services.

OpenStack CentOS alternatives:7 reasons to migrate to Ubuntu

Looking for OpenStack CentOS alternatives after recent changes in the CentOS project? Think Ubuntu – the most popular Linux distribution for OpenStack deployments, after CentOS, across development and production environments. Wondering what makes Ubuntu different? Here are seven reasons you should consider Ubuntu when planning your CentOS migration.

Why VPS Plans are Cheaper than Shared Hosting

You might be looking around for a new web host, or trying to get a better deal. You notice VPS plans are incredibly cheap - as low as $3-5 per month, while the cheapest shared hosting is around $10 per month. You wonder, "How is this possible?!" - especially when people recommend moving to a VPS once a site becomes popular. Let's discuss what these are first, before going into why cheaper doesn't necessarily mean better for you and your business.

Elastic and Alibaba Cloud: Reflecting on our partnership and looking to the future

Alibaba Cloud is an important partner to us here at Elastic. We officially started our collaboration and strategic partnership with Alibaba Cloud back in 2017, when we announced the Alibaba Cloud Elasticsearch service. Since then, we’ve seen rapid adoption and growth of the service, which now supports more than 10 petabytes of data.

Autoscaling with the HAProxy Kubernetes Ingress Controller and KEDA

One of the greatest strengths of containers is the ability to spin more of them up quickly. As the volume of traffic to your application increases, you can create more application containers on the fly to handle it, in almost no time at all. Kubernetes ships with autoscaling baked in, giving you the power to scale out when the system detects an increase in traffic—automatically!

How database DevOps can enable the evolving insurance landscape

In 2020, Deloitte reported on The four trends that define insurance and showed that the future of the insurance marketplace is going to be significantly different. Life and Property and Casualty insurers, for example, estimated that 93% of their volume already came from propositions that were not offered five years ago. New propositions were expected to keep on rising, with nearly a quarter of investment spend in insurance allocated to new product development.

Deploying Services with Docker, NGINX, Route 53 & Let's Encrypt

Docker is a power tool for deploying applications or services, and there are numerous Docker orchestration tools available that can help to simplify the management of the deployed containers. But what if you are wanting to deploy a small number of services and not wanting to undertake setting up and managing another application stack just to run a handful of containers. I will cover how I deployed a handful of services on a single Docker host.

Top 7 Tools for Adding Web Forms to Static Websites

The power of the Internet and the World Wide Web is known to everyone. Within a few years after its inception, businesses started to take advantage of all the facilities in features. And within no time, e-commerce became prominent as a new way to do business. Nowadays, it is the dominant way any company or business can reach its customers across the globe with a website.

How PayIt, a secure cloud service provider for digital government, uses Grafana and Prometheus for observability at cloud native scale

A trip to the DMV — and a realization that there had to be a better, more modern way for the system to work — sparked the idea for PayIt, a secure cloud service provider for digital government that launched in 2013. The company’s mission is to help state, local, and government agencies reach their constituents better and more effectively, shifting the reliance from in-office payments to digital ones.

Splunk App for Amazon Connect: End-to-End(point) Visibility for an Optimal Customer Experience

How do you ensure a customer experience (CX) that leaves both participants of a conversation not just satisfied, but elated afterwards? And how do you do that, thousands of times over the course of a day and millions of times a year?

Splunk and Zscaler Utilize Data and Zero Trust to Eradicate Threats

The past year has challenged us in unimaginable ways. We kept our distance for the greater good, while companies faced the daunting task of transforming their workforce from in-person to remote — practically overnight. This presented a unique challenge for cybersecurity teams. How would they ensure employees retained access to critical data in a secure way? Working in the cloud has made remote work easier for many organizations, but has also presented new risks.

How Artificial Intelligence Enhances Customer Service Management?

Throughout the world, the business and service sectors thrive on the best customer service management practices to ensure customer retention, boost customer sentiments to increase the profitability and branding of their business. But amidst the emerging technological advances, AI (Artificial Intelligence), as it is well known, is turning the table for businesses in winning the game of gaining customer trust and loyalty.

10 Simple AWS Hacks That Will Make You Super Productive

Useful AWS hacks and tricks that will save you time and money. If you work a lot with AWS, you probably realized that literally, everything on AWS is an API call; hence everything can be automated. This article will discuss several tricks that will save you time when performing everyday tasks in the AWS cloud. Make sure to read till the end. The most interesting one is listed at the very end 😉

Vaccinating Against the Ransomware Pandemic

Just as our physical world has suffered from a devastating health pandemic, over the last 3 years the digital world has been suffering a ransomware pandemic that grew 7 times in 2020 compared to the same time in 2019. While we have vaccines for our health crisis what are our options to help protect us against this digital threat?

Application Performance Management 101

To stay alive and growing, tech organizations are always on the hunt for ways to increase the quality and stability of their applications and services. Doing so is essential if they want to prevent their customers from becoming their competitors’ customers, after all. This post is all about a specific process organizations can—and should—use to increase the availability and performance of their offerings and delight their customers: application performance management (APM).

How to Optimize Website Performance

In a 2019 study from Milliseconds Make Millions by Fifty-Five and shared on Google’s official blog found several interesting insights on small speed increases. 37 brands qualified for study, after qualitative checks, with speed data measured via Google Lighthouse and aggregated against each brand’s Web analytics. The study targeted four key speed metrics. The results were fed into a Logarithmic Regression model to extract meaning.

Hybrid IT and Virtualized Workloads Preparing for a Shift to Microsoft Azure

Though government agencies continue to move to the cloud to accelerate their digital transformation plans, the majority have embraced a hybrid IT environment. This mix of on-premises and cloud implementations highlights the need for comprehensive, full-stack visibility across the entire hybrid IT environment. Without a broad view, agencies may not be able to see their cloud environment as clearly as they see what’s in the data center.

Workflowing a better world

I’m a big believer in leaving things better than I found them. I apply that mentality to work, home, and all the places in between. This planet is pretty remarkable, as are the approximately 7.6 billion people who share it. So, it’s no surprise that I want to play a part and create lasting change that leaves our world more sustainable, more equitable, and more ethical for everyone.

5 Digital Transformation Mistakes Infrastructure Leaders Make

Senior IT leaders, motivated by both the changing nature of our economy and more recently, the COVID pandemic, have decisively shifted their focus toward applications. The industry catchphrase for this shift, digital transformation, makes clear its dual nature: directed toward the digital future while at the same time acknowledging that the existing environment must be modernized—in other words, transformed. Tasked with enabling this new breed of applications are operations groups.

The State of Observability 2021: Key Findings

We at VMware Tanzu recently published our first-ever summary of the current state of observability. The main goal of our research was to uncover the key trends in observability adoption by hearing directly from IT practitioners, including DevOps teams, SREs, application architects, and their managers. We also wanted to understand what’s driving the popularity of observability and what the organizational impact of deploying observability is.

N-central Integration with Microsoft Intune

Configure Microsoft Intune device policies and get a clear view of all your customers’ devices right from your N-central dashboard. Manage your Microsoft Intune devices right from your N-able™ N-central® dashboard. Harmonize configurations and policies for all selected devices—including Intune devices—from centralized status dashboards in one place. Generate consolidated reports for devices managed by both Intune and N‑able.

Client Onboarding Checklist (Why Easy Onboarding Matters)

A client onboarding checklist will help to make client onboarding easy for both parties. There is no reason why client and customer onboarding has to be a laborious process. Utilizing the power of workflow automation tools like StartingPoint, you can transform your onboarding process into a streamlined workflow designed to enhance customer experiences. Have you considered enhancing customer experiences with an improved onboarding process?

Featured Post

How should start-ups court software talent?

There is a game of 'speed dating' going on between technology businesses and the software engineering talent that bring amazing solutions to market. In recent years big tech companies, expanding aggressively in Europe, have competed ferociously with locally headquartered tech startups for the best software engineers. These engineers are in short supply. A government-supported Tech Nation report disclosed that 10 per cent of all UK job vacancies were in tech. The report suggested that at current growth there could be 100,000 job openings per month before the end of June this year.

8 Utmost eCommerce Security Threats and Solutions of 2021

Last year, during the Magecart attacks saga, I was constantly flooded by calls from e-entrepreneurs – all sounding worried and anxious. The attacks were brazen, and the digital trust was taking a beating. After the initial shock waned, the entire conversation shifted gears. I still remember having long discussions with my clients about eCommerce security threats and their solutions. It now focused solely on building a reliable security system.

Dashboard Server: Working with the PowerShell tile

Amongst all the cool features of SquaredUp Dashboard Server, the coolest kid on the block is probably the PowerShell tile. The reason is simple – PowerShell is easy, it’s awesome, and it’s powerful! You can not only retrieve data from the source (like the APIs), but you can also manipulate that data, work with variables, loop it, filter it, and use it in whichever way works the best. Like they say, the things PowerShell can do are only restricted by the proficiency of the user.

Woopra: Your End-to-End Customer Journey Analytics Companion

Customers interact with your business multiple times before reaching any goal. These repeated digital interactions are what make up the customer journey. Your customers’ overall experience across the different channels as they engage with your organization (websites, social media, email, etc.) make up the customer experience. Customer journey analytics refers to the process of analyzing the experience of customers across multiple touchpoints in the customer journey.

Monitoring DNS Performance The Right Way With Catchpoint

The Domain Name System (DNS) is at the core of the engine that keeps the internet running. We have explained how DNS works and why it is critical to the functioning of the internet in our Synthetic Monitoring Guide. The DNS resolution relies on various components, such as the DNS resolvers, name servers, authoritative servers, and zone files, to function properly and the process typically takes milliseconds to complete.

Can I Send an Alert to Discord?

This is a great question. The answer is yes. You can send Graylog alerts via email, text, or Slack, and now Discord. Yes Discord! The growth and use of Discord has transformed from just many Gaming users to businesses using it as a communication platform. Many businesses like: Gaming Developers, Publishers, Journalists, Community and Event Organizers use Discord. Discord lets Gamer Developers work in teams with each other on their projects.

How to Build a Scalable Prometheus Architecture

When building distributed, scalable cloud-native apps containing dozens or even hundreds of microservices, you need reliable monitoring and alerting. If you’re monitoring cloud-native apps in 2021, there’s a good chance you’ve chosen Prometheus. Prometheus is an excellent choice for monitoring containerized microservices and the infrastructure that runs them — often Kubernetes.

Fintech AI/ML on Ubuntu

The financial services (FS) industry is going through a period of change and disruption. Technology innovation has provided the means for financial institutions to reimagine the way in which they operate and interact with their customers, employees and the wider ecosystem. One significant area of development is the utilisation of artificial intelligence (AI) and machine learning (ML) which has the potential to positively transform the FS sector.

Five Ways Containerized eSBCs Optimize Teams, Zoom & Other Cloud Communications Deployments

Enterprises are using Unified Communications as a Service (UCaaS) solutions like Microsoft Teams and Zoom, and Contact Center as a Service Solutions (CCaaS) like Five9 and Genesys to improve communications, simplify operations, and accelerate IT agility. As the COVID-19 outbreak clearly demonstrated, UCaaS and CCaaS solutions are ideal for delivering enterprise communications services to remote workers, mobile users, and small/home offices.

Cloud Migration Strategy [Guide]

Migrating to cloud infrastructure is one of the most critical requirements for modern enterprises to ensure long-term sustainability. Initially, there was a general apprehension about adopting cloud technology and developing a cloud migration strategy; certain aspects like security and resilience were concerns. But today, with increasing technological advancements and familiarity, there is not an iota of doubt that the advantages of adopting cloud technology have far outnumbered the few drawbacks.

How to monitor your web application availability

How do you execute an effective web application availability monitoring? All stakeholders should monitor to ensure that web app’s availability is not compromised. Great design and excellent user experience are put to waste if your web app is not accessible. Let’s establish first how web application monitoring works.

What is KFServing?

TL;DR: KFServing is a novel cloud-native multi-framework model serving tool for serverless inference. KFServing was born as part of the Kubeflow project, a joint effort between AI/ML industry leaders to standardize machine learning operations on top of Kubernetes. It aims at solving the difficulties of model deployment to production through the “model as data” approach, i.e. providing an API for inference requests.

FireHydrant April 2021 Product Updates: Incident Tags & Customizable Slack Incident Modals

We're excited to announce the release of two new features this month: customizable Slack incident modals and Incident Tags. Keep reading to more about how they can help your teams manage incidents better!

4 Characteristics of Monitoring Essential to Implementing DevOps

In the new world of rapid releases, continuous change, and increasingly high user expectations, more organizations are embracing DevOps. One of the primary drivers for adopting DevOps is speed — particularly the reduction of risk at speed. As DevOps seeks to reduce risk and deliver insight at an increasingly faster pace, new tools have emerged in the monitoring space. But these tools alone will not deliver us into the low-risk world of DevOps — not without new and updated thinking.

Five Ways Containerized eSBCs Optimize Teams, Zoom & Other Cloud Communications Deployments

Enterprises are using Unified Communications as a Service (UCaaS) solutions like Microsoft Teams and Zoom, and Contact Center as a Service Solutions (CCaaS) like Five9 and Genesys to improve communications, simplify operations, and accelerate IT agility. As the COVID-19 outbreak clearly demonstrated, UCaaS and CCaaS solutions are ideal for delivering enterprise communications services to remote workers, mobile users, and small/home offices.

Using Dokku On DigitalOcean

Dokku can be a cost-effective, convenient way to deploy apps to DigitalOcean. SolarWinds® Papertrail™ can make monitoring the logs of those apps simple and frustration-free. Combine these two technologies and you have an effective deployment process and log management system. Let’s look at Dokku first. Dokku is an open-source platform-as-a-service (PaaS). If you’re familiar with Heroku, you can consider Dokku a private Heroku that you manage.

A DBA's Habits for Success: CMMI (Part One)

Finding the perfect flow for your business can take time and patience. Like almost anything else in life, a business must go through stages of maturity before it reaches its final form and only requires regular maintenance. In this five-part series, we’ll dive into Capability Maturity Model Integration (CMMI) and what phases businesses and their DBAs must go through to successfully manage IT as a business.

Email Infrastructure Monitoring Checklist

A lot of time and resources are invested in making sure your customers get your emails. This is where email infrastructure comes in handy. While you have limited control over user interaction with your emails, monitoring email infrastructure is in your hands. Email infrastructure usually consists of your server and domain configuration, server performance, IP address, mail agents, and more. And to make sure your email infrastructure is in perfect working order, you need to constantly monitor it.

New Gartner AIOps Platform Market Guide Shows More Use Cases for Ops and Dev Teams

Gartner jumps right into it, describing a reorientation of a tool that has previously focused on IT service management and automation. AIOps is now also enabling a variety of new observability use cases for DevOps and Site Reliability Engineering (SRE) teams. This blog presents the guide’s major findings and a link so you can read the report for more details. About the AIOps Platform Market

What Comes After Kubernetes?

You probably can’t believe I’m asking that question. It’s like showing up to a party and immediately asking about the afterparty. Is it really time to look for the exit? No…but yes. We used to deploy apps on systems in data centers. Then we moved the systems to the cloud. Then we moved the apps to containers. Then we wrapped it all in Kubernetes for orchestration, and here we are. Each advance in technology unlocks doors we couldn’t reach before.

Software Engineering Daily Podcast

Large portions of software development budgets are dedicated for testing code. A new component may take weeks to thoroughly test, and even then mistakes happen. If you consider software defects as security issues then the concern goes well beyond an application temporarily crashing. Although even minor bugs can cost companies a lot of time to locate the bug, resolve it, retest it in lower environments, then deploy it back to production.

Securing Azure SQL Database, Part 3: Service Endpoints

In previous installments of my “Securing Azure SQL Database” series, I covered Azure SQL Database firewall rules and private endpoints—the first of which is a way to help reduce the public exposure of your database endpoint and the second being a means to remove all public access if necessary. Each option has unique benefits, and some scenarios might call for a mix of the two options.

Dynamic Observability: Troubleshooting Techniques for 2021

A new generation of troubleshooting techniques are making their way into the mainstream. These techniques make observability more dynamic, configurable, and intuitive. In this webinar, we discussed the importance of these new techniques, how they enable you to solve customer issues faster and increase your velocity.

CI/CD Pipeline Security 101

In our previous post, we discussed the recent security incident at Codecov and the following investigation at Mattermost. As a follow-up to that we wanted to share some of the basic design principles as well as a handful of more technical tips and tricks around CI/CD pipeline security that helped Mattermost come out of the incident unscathed.

Security threats affecting your workforce productivity? Here's how Forrester says you can address them

According to Forrester, the number of permanent, full-time remote workers is expected to increase by 300 percent or more compared to pre-pandemic numbers. Amid the chaos surrounding the pandemic, getting your entire workforce back to the office seems trivial, especially when many employees’ work can be done remotely. It’s no wonder that, according to Gartner, almost half of employees will continue to work remotely post COVID-19.

Join the Dark Side with Enterprise Alert

In recent times dark viewing mode for websites has gained a lot of popularity from users worldwide. This does not just apply to your favorites sites, but also for those applications that you rely on day in and day out. Enterprise Alert is no exception. We have heard your requests and are happy to announce that Enterprise Alert 9 now has a dark mode! In the footer of the Web Portal, there is now a Dark toggle. This theme will instantly change your viewing experience between Classic and Dark.

Martello Powers Microsoft 365 Service Excellence

Enterprise Management Associates (EMA) recently developed a report examining the business case for IT end-to-end observability and control and delved into how digital experience management was at the intersection of Microsoft 365 services and IT. Below you will find some excerpts from their report that detail how Martello solutions are able to use digital experience monitoring to provide Microsoft 365 service excellence to our clients.

MobileIron, Acquired by Ivanti, Named "Best Solution Partner" by Deutsche Telekom

Industry recognition is always something to strive for, but when the recognition comes from a trusted partner then it means even more. At Ivanti, we’re proud to be recognized by longstanding partner, Deutsche Telekom, as a “Best Solution Partner” of the year. This was the first year that the award was handed down and represents the deep commitment to partnership and execution.

Microservices Testing: A Quick Start Guide

A microservices architecture creates an application as a collection of services. Each microservice works independently and uses application programming interfaces (APIs) to communicate with other services. Each microservice has its own data store and is deployed independently. Testing a microservices application requires a strategy that accounts not only for the isolated nature of microservices but also for service dependencies.

What Is SaaS Finance? (Plus 12 Metrics You Should Be Monitoring)

One of the most important things SaaS companies need to think about in order to be successful is financial modeling. To succeed in the increasingly competitive SaaS space, finance teams need to carefully consider a wide variety of KPIs and find ways to effectively manage both present and future cash flows. Cash flows and liquidity are two of the most common challenges faced in this industry — if effectively addressed, your company can position itself for long-term success.

The easiest way to deploy your database - The Future of Qovery - Week #7

During the next four weeks, our team will work to improve the overall experience of Qovery. We gathered all your feedback (thank you to our wonderful community 🙏), and we decided to make significant changes to make Qovery a better place to deploy and manage your apps. This series will reveal all the changes and features you will get in the next major release of Qovery. Let's go!

10 Benefits Of Virtualization In The Data Center

Are you looking for ways to improve your data center performance and resource utilization? Consider employing virtualization. Virtualization offers a cost-effective solution to satisfy the growing need for storage capacities and IT support required by most organizations. It is a process that allows you to scale up your physical resources to meet your increasing demands. You can virtualize physical servers, networking, storage, and other infrastructure components to enhance your data center operations.

Monitoring Ceph health with Prometheus

Monitoring Ceph with Prometheus is straightforward since Ceph already exposes an endpoint with all of its metrics for Prometheus. In this article, we will put it all together to help you start monitoring your Ceph storage cluster and guide you through all the important metrics. Ceph offers a great solution for object-based storage to manage large amounts of data even on economical hardware. Besides, the Ceph Foundation is organized as a direct fund under the Linux Foundation.

April Online Meetup - Hypper: Dependency-aware package management for Kubernetes

Introducing Hypper, a new package manager for Kubernetes designed with cluster administrators in mind. Hypper is built on Helm and charts but makes some different assumptions around multi-tenancy and dependent charts (which can be useful with CRD handling). Where Helm assumes a user could be one of many users running in multi-tenant, Hypper assumes the user is a cluster administrator managing a cluster.

Announcing role based access control for API keys for more control over automation

Today, Gremlin is excited to announce the ability to create an API key that can perform actions with the same set of permissions as your user account. This allows you to automate Gremlin tasks safely and securely.

Optimizing Load Times on Apache Web Server on Digital Ocean With SolarWinds Pingdom

Introduction We all want the fastest application possible for our customers. At the same time, we’re under pressure to continuously add new features. These new features add complexity, which makes our application heavier, which in turn slows our applications down. So, how do we add new features, yet keep the performance of our application high?

The essentials of Windows event logging

One of the most prevalent log sources in many enterprises is Windows Event Logs. Being able to collect and process these logs has a huge impact on the effectiveness of any cybersecurity team. In this multi-part blog series, we will be looking at all things related to Windows Event Logs. We will begin our journey with audit policies and generating event logs, then move through collecting and analysing logs, and finally to building use cases such as detection rules, reports, and more.

DEM and secure connectivity vendor offerings: the fox & the henhouse

A typical service delivery chain starts from the device and runs through the network and all the way through to the application. There are many things that can go wrong along the way! It’s critical to monitor that experience and quickly understand where issues occur, why they occur, and what can be done to remedy them. That’s where employee and/or customer Digital Experience Monitoring (DEM) comes into play.

Monitor applications on GKE Autopilot with the GKE Dashboard

Elite software development teams automate and integrate monitoring observability tools more frequently than lower performing teams, per the Accelerate: State of DevOps report. Organizations that need the highest levels of reliability, security, and scalability for their applications choose Google Kubernetes Engine (GKE). Recently we introduced GKE Autopilot to further simplify Kubernetes operations by automating the management of the cluster infrastructure, control plane, and nodes.

5 typical mistakes in alerting and how to avoid them

A good alerting strategy is an important prerequisite for successful operations management and the availability of mission-critical systems. But also for employee satisfaction. It’s not just about sending out alerts upon critical conditions, problems and failures at all, but more importantly, about how it is done. Here are the 5 most typical mistakes, their consequences and how to avoid them.

Export API v2 - Streamline Large Log Data Exports

The LogDNA platform improves how teams use logs to help with debugging and troubleshooting. However, having fast access to actionable data isn’t the only value you can get from logs. There’s a lot of additional value in analyzing historical log data to understand long term trends. For example, customers can use log data as a way to represent audit events for user actions and benefit from visualizing them in a 3rd party software.

Choosing Azure Instances for Microsoft WVD: Community and Vendor Resources

In an earlier blog, we had discussed what is Microsoft Windows Virtual Desktop (WVD) and why it is gaining popularity. In this blog, we present various community and vendor resources that can help you choose the right Azure instances for your Microsoft WVD deployment. Here, at eG Innovations, we offer a wealth of monitoring and simulation tools to allow you to monitor what real users are experiencing when accessing Microsoft WVD.

Comparison: Snyk, Aqua Security, Sysdig

Security testing tools help us to monitor our cloud-native resources for potential vulnerabilities throughout our development lifecycle. By flagging security vulnerabilities early, our teams can react on time to reduce potential damage to our end-users and our business. This post will compare three different Security scanning tools that focus on cloud-native infrastructure and application security, namely.

Webinar: How Medtronic Tripled Serverless Development Velocity

In this webinar, experts from Medtronic and Lumigo review the architecture and monitoring setup of Medtronic's AWS serverless environment, which processes more than a billion Lambda requests a month. They will show real-world examples of how the Medtronic serverless dev team quickly finds areas for improvement and acts on them.

Collision 2021 - Securing Software Pipelines with Continuous Packaging

Building automation and security into software supply chains requires packaging source code, dependencies and containers into logical, versioned units. But, in 2021, how engineers package their software is more vital than ever, requiring a serious refresh with a name befitting its focus on security within the cloud. We call it Continuous Packaging. Now on-demand, watch our talk from Collision Conference 2021 where we explored Continuous Packaging and how it can help secure your delivery pipelines, from development through to deployment.

How to Plan a Threat Hunt: Using Log Analytics to Manage Data in Depth

Security analysts have long been challenged to keep up with growing volumes of increasingly sophisticated cyberattacks, but their struggles have recently grown more acute. Only 46% of security operations leaders are satisfied with their team’s ability to detect threats, and 82% of decision-makers report that their responses to threats are mostly or completely reactive — a shortcoming they’d like to overcome.

Q&A With Forrester Senior Analyst Rich Lane

I recently had the honor of moderating a webinar featuring Forrester Senior Analyst Rich Lane and Steve Breen, Head of Managed Services at ANS, titled “AIOps for the Modern Enterprise: Real-World Advice & Implementation Tips from the Pros.” In this informative session, Rich and Steve talked about the importance of building AI and automation into business strategy and provided tips, tricks, and real-life examples of how modern organizations are using AIOps to drive positive business outcomes

We've added first-class Windows support to Grafana Agent

The Grafana Agent team is happy to announce that Grafana Agent 0.14.0-rc2 includes improved Windows support. Up until now, running Grafana Agent — our tool for gathering metrics, logs, and traces — in Windows was difficult and not well supported for Windows best practices. In short, it was not a good Windows citizen. In the new release candidate, we’re making changes to improve the experience, based on feedback from GitHub issues, customer contacts, and our own experience.