January 2022

Top 5 challenges in Ethernet monitoring and how to simplify them with OpManager

An Ethernet connection helps businesses with critical communication, and even a slight interruption can irritate users or result in costly downtime. On top of this, the larger the network, the more complex the Ethernet network becomes.

Be in charge of your cloud costs with these 2021 releases: CloudSpend recap

The widespread adoption of digital transformation triggered by the global health crisis has created a boom. For some businesses, the transition was smooth, but for others, it was an aggressive shift. Out of all the challenges posed by the transition to digital environments, messy cloud cost management and rocketing cloud bills are the most taxing. According to Gartner, through 2024, 60% of infrastructure and operations leaders will bear cloud costs that hurt their on-premises budgets.

Struggling with blurry website imagery? You're not alone. Here's how to optimize for better image clarity across different browsers.

When it comes to your website, visual content plays a huge role. In a world where it takes our brains only 13 milliseconds to process an image – visuals help narrate your brand story in a quick and visually captivating way. This ability to process images so quickly places even more importance on the need for crisp high-quality content. A website tainted with fuzzy and blurry images can affect engagement and lead to an overall negative experience for visitors.

Improvements Made To AppSignal Node.js Integration In 2022

During the last few months, we've been working hard on improving our Node.js integration. We've released loads of quality fixes and improvements to our diagnose command, configuration, and general package structure. Today, we'd like to highlight some of the enhancements and fixes that we've recently released.

Adopting Cloud Technology

In the second half of 2021, eG Innovations partnered with the DevOps Institute to conduct an online survey of more than 900+ individuals from Sys Admin, DevOps, SREs, and other IT backgrounds. We asked questions about: Some of the results included: You can download the full survey results here: Cloud Technology Adoption Trends | eG Innovations If surveys and statistics on technology adoption are of interest, we have some other recent ones available, conducted in the last 12 months,.

Thundra Foresight: Now Supporting Azure DevOps CI Platform

Are you in DevOps and sick of looking at logs while waiting for your build to finish? Or are you a quality engineer running automated tests in the CI pipeline and wondering why your tests failed? Perhaps you’re a developer trying to debug a failed application in the test environment.

Ocean explained: Ocean controller deepdive

As a managed data plane service for containerized applications, Spot Ocean provides a severless experience for running containers in the cloud. Ocean integrates with the control plane of your choice, and handles key areas of infrastructure management, from provisioning compute and autoscaling, to pricing optimization and right-sizing. A core component of Ocean’s architecture is the Ocean controller, which is how Ocean and your Kubernetes cluster integrate and interact.

Cluster Roll Feature Enhancements Now Available

Spot by NetApp’s Ocean includes a powerful feature called “cluster roll.” This feature simplifies applying changes to Kubernetes worker nodes. Typical changes include applying a new image, modifying or adding user data, and updating security groups. A cluster roll applies these changes without having to disable the Ocean autoscaler. It also removes the need for you to manually attach new nodes or remove replaced nodes from the cluster.

"What's in it for us?": Putting Users in the Driver's Seat of VDI w/ VMware

Back in 2007, when the VMware team was outlining the benefits of virtual desktop infrastructure (VDI), our presentations included a very specific use case: “global pandemic”. No, we didn’t have a crystal ball through which we could foresee the COVID crisis, more than a decade in advance. But even back then, we were looking at the security benefits of VDI, if global health crisis did suddenly force workforces to go remote.

Using Configuration Management to Detect Unwanted Software

The Log4j vulnerability is the latest cyber exploit, bringing a CVSS critical score of 10. It allows attackers to execute arbitrary Java code on remote computers, including accessing sensitive information. Only a year since the world addressed the SolarWinds supply chain attack, it’s another confirmation that network professionals must adopt long-term risk-management strategies. Are Opmantek products affected?

The Big Takeaways from Cyber 5 2021

If you did your holiday shopping online this year, you’re not alone. Cyber 5, the five days between Thanksgiving and Cyber Monday, represented one-fifth of all eCommerce sales for November and December in 2021 (despite a slight decline in overall spending since last year). Americans shelled out $8.9 billion on Black Friday deals and $10.7 billion on Cyber Monday specials.

7 marketing trends MSPs should be watching for in 2022

If you haven’t started planning your marketing activities for 2022, it should be your number one priority this month. But what should you be doing and where do you start? To kick off 2022, I thought a post on the top marketing trends MSPs should watch for in 2022 would be a great springboard to get those creative juices flowing.

What Should A Board Know About Tech In 2022? | Splunk & Accenture

There is so much happening in the technology space let alone each individual global market, how does an organisation keep up? What trends do they need to keep an eye on and which ones do they need to invest in? We will discuss some of these issues today. Join Brian Berg, Principal Director at Accenture, Blanca Galletero, Splunk’s GVP EMEA GTM Ecosystem and Mark Woods Chief Technical Advisor EMEA at Splunk as they discuss the topic ‘What should a board know about Tech in 2022?’.

Major New Features and Enhancements Now Available for Ivanti Workspace Control

We have just released Workspace Control 2022.1 (10.8.0.0), which contains three major new features and several bug fixes. We’ve also included several workflow improvements based on User Voice requests which were submitted via the Product Ideas page on the Ivanti Community.

Tip: DNS Speed Performance Test

Have you checked your DNS speeds lately? When it comes to online performance, speed is king! The faster your service resolves queries, the quicker end users get to their destination and the happier your customers will be. Here’s the thing: When it comes to DNS speed, every millisecond over your competitors’ speeds could mean a lost customer. The good news is, there’s a way to test your speeds and to see just how your DNS speeds stack up against the competition.

Run Datadog Synthetic tests in your Jenkins pipelines

Continuous integration (CI) has become the mainstream approach to software development as it enables organizations to iterate quickly while minimizing the risk of releasing faulty code. To implement CI, many organizations rely on Jenkins—one of the most mature and widely used automation servers on the market. Jenkins comes with hundreds of community-backed plugins to help you easily integrate it with other tools in your development workflow.

Sponsored Post

How to Troubleshoot Your Network Using Traceroute

Anyone who has investigated the cause behind a network issue knows how daunting it can be to pinpoint the problem. With so many different variables, the answer can be elusive. In our last blog, we explained how you could test your DNS speeds in PerfOps. Today, we’ll be showing you how to use our Traceroute tool to troubleshoot network issues and improve DNS speeds.

Data Mining Methods: The Top Five

Knowing what business problem you want to address will help you know which data mining technique will produce the best results. Each of the data mining strategies listed below addresses a different business challenge and yields a different result. We are surrounded by big data in today’s digital world, which is expected to rise at a rate of 40% per year over the next decade. The irony is that we are awash in data yet deficient in knowledge. Why?

Top 5 Tools to Test Your Website in 2022

The reliability of a website affects its earning potential, as every second in the digital world counts. According to a study by BCG and Ryte, every second of loading speed costs from $3,000 to $9,000, depending on the eCommerce industry. That shows that your website has to perform optimally all year round. It's the only way to avoid losing money. Aside from outstanding performance, you need to work on your website's design. Certain tools help you improve it to get more conversions.

Feature Spotlight: Centralized Log Collection

Speedscale is proud to announce its Centralized Log Collection capability. When diagnosing the source of problems in your API, more information is better. For most engineers, the diagnosis process usually starts with the application logs. Unfortunately, logs are usually either discarded or stored in Observability systems that engineers don’t have direct access to. Compounding this issue is that the log information is typically not correlated to what calls were made against the API.

Who is Most Vulnerable to Ransomware Attacks? New Report Reveals Latest Trends.

No one will be surprised to hear that ransomware is, once again, on the rise. The last two years have seen a stratospheric increase in both the frequency and sophistication of attacks. In a just-released report from Ivanti, Cyber Security Works and Cyware, 2021 closed out with alarming statistics including a 29% increase in CVEs associated with ransomware, and a 26% increase in ransomware families compared to the previous year.

7 AWS Migration Strategies That Can Help Prevent Overspending

Many companies don’t know where to begin when migrating to AWS. Some worry their data will leak, while others don't know the most efficient migration strategy for AWS. Another group worries about overspending when moving to AWS. The migration strategy involved plays a crucial part in all of these concerns. This guide will discuss more than just AWS cloud migration strategies.

Puppet's new Cloud Migration Service helps migrate your PE installation

Adopting a public cloud platform like AWS has many benefits, but the process of moving your existing automation capabilities between on-prem and the cloud can present challenges and make it difficult to take full advantage of cloud. In fact, in a recent survey conducted by Puppet, we learned that many Puppet users are significantly influencing their organizations’ cloud migration planning, indicating that Puppet can play a key role in cloud migration.

Prevent Data Downtime with Anomaly Detection

A couple months ago, a Splunk admin told us about a bad experience with data downtime. Every morning, the first thing she would do is check that her company’s data pipelines didn’t break overnight. She would log into her Splunk dashboard and then run an SPL query to get last night’s ingest volume for their main Splunk index. This was to make sure nothing looked out of the ordinary.

What's Next for AIOps? 4 Trends for the Future of AIOps

As an idea conceived by Gartner four years ago, AIOps is already a mature practice. But it is also one that continues to evolve as businesses turn to AIOps to support new use cases, and as AIOps vendors build better and more efficient AIOps tools. That fact begs the questions: what’s next for AIOps? What are the relevant trends that will shape the future of AIOps over the next several years, and how will AIOps use cases evolve going forward?

Announcing our newest integration: Confluence

Using FireHydrant’s Runbooks, incident and retro data can be automatically sent to Confluence at any point in the incident lifecycle. For example, the moment you’ve resolved an incident FireHydrant can create a fresh Confluence page with all of the critical incident information stored in FireHydrant. When utilizing Runbook conditions, you can choose the perfect moment to send your FireHydrant retro to a Confluence workspace.

What Your System Outage Notifications Need To Say

System outages happen to the best of us. Communicating with your customers and other stakeholders effectively during downtimes is vital to maintaining a solid relationship with them. When a system outage occurs, technical teams are tasked with swiftly locating the cause and resolving the issue, while communications teams are tasked with notifying stakeholders and customers about the outage to maintain transparency.

Is Shadow IT Impacting Your Security? How An Organization Restored 90% Compliance in One Day

Just how effective can an employee engagement campaign be? Consider this: A single Nexthink Engage campaign prompted 90% of employees to update their browser in one day. Despite not having access to the enterprise version of the Google Chrome browser, thousands of employees in this U.S. biopharmaceutical company downloaded the personal version of Google Chrome. 5200 employees to be exact.

How to Really Benefit From Problem and Change Management

In the early days of the pandemic and in 2021, teams were on their heels reacting to everything thrown their way: changes in work dynamics, accelerated digital transformation, and new support procedures and expectations. Providing exceptional support isn’t only about quick response and incident resolution; much of it boils down to processes behind the scenes that keep you proactive. Enter: problem and change management.

Global Online Meetup - January 2022 - SUSE's Cloud Native Security Roadmap 2022

In the January Global Online Meetup we look at our Cloud Native Security Roadmap for 2022. We're covering: Hosted by Tim Irnich, Director Technical Evangelism, SUSE & Rancher Community, with panelists: Our Global Online Meetups provide an inside view into the latest and greatest of SUSE technology. Our colleagues from Engineering & Innovation present their newest ideas and projects and are eager to hear feedback from our community.

Broadcom and AppNeta Deliver Industry-Leading Network Monitoring and End-User Experience Monitoring

On December 7, Broadcom announced its intent to acquire privately held AppNeta Inc. headquartered in Boston, MA. AppNeta’s SaaS-based solutions provide enterprise IT teams with precise, end-to-end visibility into network performance from the end-user’s point of view. Combined with DX NetOps by Broadcom Software, AppNeta monitoring capabilities will help enterprises and service providers to more efficiently diagnose and improve network performance for end-users, independent of what network they use to access applications. For more info, visit broadcom.com/netops

Rancher Desktop 1.0.0 Has Arrived

We are happy to announce the 1.0.0 release of Rancher Desktop. This release has been months in the making since development on Rancher Desktop began. After starting small and learning what users needed, we were able to adjust its path and develop the features needed for a 1.0.0 stable community release. But wait – what is Rancher Desktop again? It’s an open source app for desktop Kubernetes and container management on Mac, Windows and Linux.

Harnessing the power of the cloud to create personalized experiences

Technology teams are under more pressure than ever before. They’re balancing the demands of a changing workplace, growing customer expectations, and shifting from traditional to digital delivery. While managing more applications with less visibility, they face expectations to deliver fast, customer-grade experiences. These digital experiences are increasingly enabled by the cloud.

DevOps Methodology | Goals, Principles & Process

Wondering what DevOps Methodology is all about? We will explain what it is, how it works, and the principles and processes that make it successful. What is DevOps Methodology? DevOps methodology is a development process where Development and IT Operations collaborate throughout the lifecycle to facilitate faster deployment of reliable software products.

MDM vs RMM

For decades, IT teams have relied on on-premises, network-centric approaches to IT management. As these teams have grown to manage more and more cloud- and SaaS-based products for the business users, they have staunchly persisted in their use of legacy management tools. Even the growing mobile, field, and remote workforce has only marginally impacted IT management methods, with most simply finding workarounds to their network-centric systems that leave end-users frustrated and unsatisfied.

Adapting to the Digital Future: IT Efficiency in 2021

In the last year, the dual pressures of accelerating digital transformation and delivering seamless remote experiences have challenged IT teams to fundamentally adapt. Facing constrained budgets, threats from would-be cybercriminals, and a surge in new remote devices, their role has grown both increasingly critical and increasingly complex. NinjaOne's latest report looks at the top challenges and trends impacting IT efficiency. Read the report to learn how manual processes could be costing IT teams time, money, and exposing them to new risk.

Raygun Alerting: Monitor your latest deployment

Modern development teams are shipping code faster than ever before. Having visibility into the issues that will inevitably get introduced into your software is crucial for the development process. Latest deployments for Raygun Alerting helps with just that. Now, you can tick the latest deployment checkbox on all Raygun alert types to only monitor your latest deployment and resolve issues before your customers ever even notice.

4 reasons why network visualization is integral to successful network management

Businesses in today’s world use networks for almost all their operations. As businesses grow and expand with time, so do their needs. As a result, their networks can become increasingly complex and sophisticated. This can result in network administrators having a harder time monitoring devices and identifying faults. These bottlenecks can be circumvented with network visualization.

Smart, agile MLOps on any cloud - Canonical releases Charmed Kubeflow 1.4

Today, the Canonical Data Platform team announced the release of Charmed Kubeflow 1.4 - the state-of-the-art MLOps platform. The new release enables data science teams to securely collaborate on AI/ML innovation on any cloud, from concept to production.

Why Your Business Needs To Embrace Automation

Automation is a word that can cause some discomfort in the business world. There have been concerns that automation will lead to job losses and distress workers, but actually, automation can be used to help employees, and the two can co-exist in the workplace in peace. The technology has come on leaps and bounds in recent times, and now there are many different tasks and processes that can be automated with online tools, software, or equipment. Automation can bring a wide range of benefits to your business, so keep reading to discover why this should be something that is embraced.
Sponsored Post

Five Ways Developers Can Help SREs

Reliability is a team game. More the collaboration between Developers and SREs, greater will be the success of the product. In this blog, we have listed down the five best practices that developers can adopt, to make the SRE's life easier. It is not easy to be a site reliability engineer. Monitoring system infrastructure and aligning them with the key reliability metrics is quite a daunting task. Whereas, a software engineer's job is to deliver high-quality software.

What is Observability? Benefits, Use Cases & More

The year is over, and the word ‘Observability’ has been one of the buzzwords that kept everyone checking throughout the year for deserving reasons. The organizations do not want to leave any stone unturned to maintain performance and offer robust services from ‘monitoring’ practices to ‘observability’, ‘telemetry’, and visibility capacities. So let’s get into the meaning of each term and understand how they are vital for business growth.

How to Speed up Builds in Your CircleCI CI Environment

In contemporary software creation, we want to work in an isolated environment. With so many dependencies needed to make a product work—many of them dependent on how other dependencies are configured—this makes total sense. An isolated environment can help protect the system at large while we solve these problems, hence the growing popularity of virtual environments and containers.

Evolve to a Risk-Based Vulnerability Remediation Strategy with a Cloud-Native Patch Management Solution - Now Available from Ivanti

Ransomware attacks are increasing in frequency and severity every year. The impact to companies is devastating. These attacks typically lead to lost business for companies as they often cause increased customer turnover, system downtime, diminished reputation and other adverse side effects.

Why DevOps needs a change from a 'me' to a 'we' mindset

Chris Yates is a Senior Vice President, Managing Director at Republic Bank, one of the most innovative and forward looking banks in the US. For the last five years and more, he and the bank have been working to introduce DevOps to their database development process. Along the way he has had the opportunity to be joined on his DevOps journey by several of his colleagues who have played an instrumental part in enabling cross collaboration among the development teams.

Introducing CommsFlow for Context-Rich and Timely Updates to All Stakeholders

We’re so excited to announce our latest platform feature, CommsFlow™! This addition to the core Blameless product offering allows teams to keep stakeholders updated as the reliability of services and applications change. With our new automated and customizable communication flows, on-call, engineering, and business teams feel a sense of accomplishment and, of course, stay informed.

The Five Tenets of Observability

A new year is a chance to have a new start, and one thing that it’s a great opportunity to think about is the monitoring and observability platform you’re using for your applications. If you’ve been using a legacy monitoring system, you’ve probably heard about observability all over the ‘net and want to figure out if this is really something you need to care about.

Algist Bruggeman Uses Insights from InfluxDB to Optimize Industrial Processes and Production

Founded in 1884 and located in Ghent, Belgium, Algist Bruggeman supplies fresh, liquid, and dried yeast to industrial, semi-artisanal, and artisanal bakeries, as well as to the beer, wine, and pharma industries. Algist Bruggeman is part of the Lesaffre Group, a key global player in fermentation for more than a century. Even with more than a century of industrial production behind it, Algist Bruggeman continues to evolve its manufacturing processes.

Get Paid to Write About Mattermost Playbooks

Mattermost Playbooks help software engineering teams orchestrate their work across all tools and teams to plan projects and hit milestones by uniting your tech stack through a single point of collaboration. We want to see how our community is leveraging Playbooks in their own tech stack and share your creations with everyone so the whole community benefits. We’re doing this by launching a new effort to commission original blog articles that show Playbooks in action.

Go from reactive to proactive IT operations with AI

On your journey from reactive to proactive IT operations, you may be unsure where to start. If your organization is lacking visibility into your operations, that’s the best place to begin. Automation and artificial intelligence (AI) can help you gain that needed visibility. AIOps and visibility Artificial intelligence for IT operations (AIOps) offers numerous benefits, including faster response time, improved IT health, and simpler IT management.

Grafana Tempo 1.3 released: backend datastore search, auto-forget compactors, and more!

Grafana Tempo 1.3 has been released! We are proud to add the capability to search the backend datastore. This feature will also appear soon in Grafana Cloud Traces. If you want to dig through the nitty-gritty details, you can always check out the v1.3 changelog. If that’s too much, this post will cover the big ticket items. You can also register for our upcoming webinar “Distributed tracing in Grafana: From Tempo OSS to Enterprise” on Jan.

Data Center Skills in High Demand in 2022

As modern data centers grow increasingly complex and distributed, data center technology and infrastructure can become more difficult to manage, and it becomes harder for IT decision-makers to find qualified candidates. According to Uptime Institute, 50% of data center owners and operators are struggling to find qualified candidates. In recent years, the COVID-19 pandemic has accelerated industry trends such as remote data center management, deepening the already severe data center skills gap.

The Business Case for Docker Adoption

Container technology is considered one of the most rapidly evolving in the software industry's recent history. There has been a seismic shift towards more and more organizations adopting containerization for their applications. Containers offer a lightweight, portable, and more efficient alternative to virtual machines and help us run software securely and reliably across different server environments.

Our top prediction for MSP M&A: It's a platform grab right now

The managed services provider (MSP) industry has been rapidly consolidating for the past several years as private equity (PE) firms buy up small IT firms and cobble them together into larger platforms. But that process is evolving quickly for two main reasons: there aren’t enough sellers of quality assets and PE firms have shortened how long they hold their investments. That means they need to hunt for bigger game—and with a rifle, not a shotgun—before they cash out and exit.

Mind Your Dependencies: Defending against malicious npm packages

Modern software projects are mostly composed of open source code. The question of who really controls this code, and is responsible for detecting and fixing software supply chain security issues, became a significant source of concern after the discovery of the Log4Shell vulnerability.

The Architecture of AIOps: 4 Best Practices

Artificial intelligence for IT operations (AIOps), although the new kid on the DevOps block, is here to stay. It’s the future of DevOps and the future is here with us now. AIOps is just the infusion of artificial intelligence to the practice of DevOps. This practice has shown that it’s of great value to a lot of software development firms. Firms that have already implemented AIOps have recorded a massive boost in overall IT productivity.

5 Ways to Improve Your Application Performance Monitoring

The world and technology keep evolving. Over time, applications with functions ranging from buying and selling online to holding meetings to keeping up with friends and family have progressed. Now, we are able to automate actions that used to be manual or at least perform them in the most efficient way possible. This automation is made possible through the use of our applications. Now imagine one of these applications stops working for just 10 minutes.

Automating Notification & Response with Notification & Collaboration Tools

With the ScienceLogic SL1 platform correlating and contextualizing data to generate actionable events that accurately reflect the issues that need attention, how do you make sure all your engineers and system admins are on the same page?

Datadog Cloud Security Platform

Datadog's Cloud Security Platform—consisting of Cloud SIEM, Posture Management, and Workload Security—delivers real-time threat detection and continuous configuration audits across your applications, hosts, containers, and cloud infrastructure. Datadog derives security insights from your observability data, enabling security and DevOps teams to work together to detect, investigate, and remediate threats.

How managed service provider CTAC onboards customers at lightning speed

As a managed service provider, CTAC provides all kinds of IT services for its clients. Efficient monitoring is crucial for them: that way, they can stay on top of performance issues within their customer's IT environments and, eventually, keep their customers happy. However, because CTAC works with many different clients and platforms (such as Microsoft and SAP), their monitoring is often very siloed - which makes it difficult to get an overall view of the performance and health of their customers' IT services.

Introducing CloudZero Budgets: Improve Cost Predictability And Eliminate Surprises

The best budgets aren't roadblocks. They're guardrails: boundaries for quick, collaborative work. Historically, when finance and engineering teams have discussed cloud cost, they’ve run into an obstacle: They don’t speak the same language. At the end of each month, finance gets an ever-changing cloud bill, and engineering explains that, whatever the total, it represents what they need to do their work. Stalemate.

Tanzu Tuesdays 82: Building Production Ready Container Images at Scale with Cora Iberkleid

Building and maintaining production ready container images is a critical requirement for success with Kubernetes. Developers and organizations alike have put a lot of effort over the past decade into DIY solutions, whether it is working on the “perfect” Dockerfile, or automating the build as part of an existing pipeline. These home-grown solutions certainly address immediate needs. If we take a step back, however, we can see significant gaps that introduce overhead, risk and inefficiency. Luckily, as the Kubernetes ecosystem matures, new solutions become available. We can rethink the problem at hand, set higher goals, and achieve them more easily. In this talk, we’ll explore how Cloud Native Buildpacks—and kpack in particular—can boost your image-building capabilities at scale. We’ll also cover how you can easily use kpack and Knative together with Tanzu Community Edition.

Make the most of your observability data with the Data Volume app

As a DevOps, SecOps, or IT operations manager, you're surrounded by all the technology for the systems running the entire organization. This means legacy infrastructure, multi-cloud environments, services, tools, and applications. All of these components generate data—a huge amount of data—some of which you need to leverage for full-stack observability to ensure those systems supporting the business are running efficiently.

Using Oracle Cloud as a Data Lake Made Simple With Cribl LogStream

All Cloud providers such as AWS, Azure, Google Cloud Platform, and Oracle Cloud offer Object Storage solutions to economically store large volumes of data and retrieve it on demand. It’s far cheaper to store one petabyte of data in object storage than in block storage. As AWS S3 has become the standard, many on-premise storage appliance vendors have incorporated S3 APIs to store and retrieve data. Oracle wisely continued that trend to OCI (Oracle Cloud Infrastructure).

Goliath Technologies Announces Release of Citrix Logon Duration Scorecard

Philadelphia, PA – January 25, 2022 – Goliath Technologies, a leader in end-user experience monitoring and troubleshooting software for hybrid cloud environments, announced today the release of their new Citrix Logon Duration Scorecard, expanding on its industry-leading end-user experience reporting and analytics suite. 

10 Microsoft Teams Performance Use Cases for IT Admins

Dependence on Microsoft 365 and Teams has never been greater, and the pressure is on for IT teams to deliver exceptional user experiences - anytime, anywhere. The modern workplace sees users connecting from the office, home and pretty much any place in between. This hybrid work model has a significant impact on IT, the network and the overall quality of service perceived by the users.

Observability Pipelines for Dummies

How do you get the data out of your infrastructure and applications in order to properly observe, monitor, and secure their running states while minimizing overlap, wasted resources, and cost? Many business folks need a broad category of tools in all their environments to solve challenges such as up and down monitoring, metrics, a time series database (TSDB), log analytics, event streaming, security information and event management (SIEM), user behavior analytics (UBA), and data lakes. The answer to the proposed question to solve these hurdles is using an observability pipeline.

Designing production-ready AWS serverless applications

Serverless has become an increasingly popular paradigm among organizations looking to modernize their applications as it allows them to increase agility while reducing their operational overhead and costs. But the highly distributed nature of serverless architectures requires developers to rethink their approach to application design and development. AWS-based serverless applications hinge on AWS Lambda functions, which are stateless and ephemeral by design.

Best practices for building serverless applications that follow AWS's Well-Architected Framework

In part 1 of this series, we looked at common design principles and patterns for assembling microservices in serverless environments. But when it comes to building serverless applications, designing your architecture is only part of the challenge. You also have to ensure that each of your individual functions and services are secure, reliable, and highly performant—without incurring enormous costs.

Meta has built an AI supercomputer it says will be world's fastest by end of 2022

Social media conglomerate Meta is the latest tech company to build an “AI supercomputer” — a high-speed computer designed specifically to train machine learning systems. The company says its new AI Research SuperCluster, or RSC, is already among the fastest machines of its type and, when complete in mid-2022, will be the world’s fastest. “Meta has developed what we believe is the world’s fastest AI supercomputer,” said Meta CEO Mark Zuckerberg in a statement.

Achieving Website High Availability

When someone says a website is available, they mean that they can access that website. The application they’re trying to reach is up and working properly. High availability means that the website is up most of the time throughout the year. Companies can even put a percentage on this, striving for 100% availability, but typically getting somewhere a bit less, such as 99.9% or 99.99%.

My First Impressions with SUSE Rancher Kubernetes Projects

I recently started working at SUSE. Before joining SUSE, my Kubernetes experience included vanilla Kubernetes, AKS and EKS but mostly OpenShift and Red Hat Advanced Cluster Management. I worked in technical pre-sales, so I knew about Rancher, K3s and RKE and their key features but I never spent time with them. When I joined SUSE, I started testing Rancher, Rancher Desktop, K3s, k3d and RKE2 and I had a great time with them. First things first, I will

19 Questions To Ask Your Cloud Cost Management Vendor

Not all cloud cost management tools are equal. Whether you’re in the process of evaluating cloud cost management vendors or already have a tool in place, here are 19 questions you should ask to ensure you have all the capabilities needed to maximize performance and minimize cost of your hybrid cloud deployment across the following.

StatusIQ: A roundup of our journey in 2021

In between online meetings and chat conversations, we've all embraced the digital way of life and work, and it is here to stay. We may not know any other way to operate businesses in a few years' time except the digital space. This way of life will require clear communication channels for businesses to connect with their users. Keeping that in mind, as well as your feedback in our community, we've shaped StatusIQ to help ease the incident communication process.

Press Release: Kubernetes Management Pack Announcement

Today OpsLogix announces the upcoming release of their new Kubernetes Management Pack. This product is designed to help organizations monitor their Kubernetes clusters using System Center Operations Manager (SCOM). The management pack provides comprehensive monitoring of all aspects of your Kubernetes environment, from individual nodes and pods to entire clusters.

How Reliability and Product Teams Collaborate at Booking.com

With more than 1.5M room nights booked per day, Booking.com requires a solid infrastructure that’s constantly monitored. And indeed, Booking.com now has a footprint of 50,000+ physical servers running across four data centers and six additional points of presence. The sheer size of this server fleet makes it viable for Booking.com to have dedicated teams specializing into looking only at the reliability of those servers.

Ask the Product Experts | THWACK Livecast

We were all new once, stumbling in the dark, feeling our ways along the walls looking for the light. Searching blindly isn’t a good way to approach any technology project, so hearing from the experts is one of the better ways to expound on your knowledge. It doesn’t matter if you’re new to monitoring or a long-time SolarWinds professional; we’re sure there are questions you’ve got for our seasoned veterans.

Shipa Common Abstraction Layer - AaC - Application as Code

In this Shipa Shorts video, learn about having a common layer for your applications e.g Application as Code . The only constant in technology is change. As your CI/CD and IaC stacks change, Shipa's abstraction stays the same for your development team. In this example, goes through Argo CD, Terraform, and GitHub Actions all producing the same result.

Argo CD 101 and Setup for Shipa

There is a lot of the art of the possible between the GitOps Engine, Argo CD, and the Application-as-Code platform, Shipa. In a recent blog post, we outlined the power of a one-line developer experience. Though if you are unfamiliar with ArgoCD, here is a guide to get you started with Argo CD and leveraging Shipa for your first deployment.

Continuous Service Virtualization, Part 2: Steps for Optimizing DevOps

In my prior blog, Continuous Service Virtualization, Part 1: Introduction and Best Practices, we offered an introduction to continuous service virtualization (SV) and discussed some key best practices. In this, the second and final post in the series, we will discuss the continuous SV lifecycle and how it helps to optimize DevOps and the continuous integration/continuous delivery (CI/CD) pipeline.

Continuous Service Virtualization, Part 1: Introduction and Best Practices

Service virtualization (SV) has evolved as a popular technique and technology over the last decade. Traditionally, SV has primarily been used by testers to simulate other application components that the application under test interacts with. Typically, virtual services have been created and maintained by center of excellence (COE) teams.

Leveraging AIOps to Enable Greater Customer Experiences

As time progresses and competition grows, being “good enough” means that you may be falling behind. Engineers will discover new ways to solve problems, which will enable rapid increases in availability and scalability. With these increases comes more complexity and the generation of more data. Rather than just monitoring the new data and letting the old data sit there collecting dust, you should consider using it to gain maximum insights into your environment.

Event Orchestration Demo: Reduce Noise & Manage Event Routing with PagerDuty

Say hello to the next generation of event rules and cut down on manual event processing. With Event Orchestration, you can create custom logic with nested rules to enrich, modify, and control routing or trigger automation actions based on event conditions at scale. (This feature is only available to Event Intelligence and Digital Operations plans).

AWS Re:Invent 2021 - Accelerate Your Cloud Migration for Financial Services

Cloud migration and modernization projects for financial services are very complex initiatives with added challenges of visibility and incident response. He’s how we can help accelerate cloud adoption while reducing customer impact and streamlining and automating incident response.

A (de)bug's life: Diagnosing and fixing performance issues in Grafana Loki's read path

Beep, beep, beeeeeeeep. Read path SLO page, again. And I’ve almost found the noisy neighbor! That was me. And will probably be me again at some point in the future. As we continue to scale up the team that builds and runs Grafana Loki at Grafana Labs, I’ve decided to record how I find and diagnose problems in Loki.

Start Logging Everything: Humio Community Edition Series

In this blog, we’ll show you, step by step, how to download stock data and then upload it to Humio. You can then search that data and build a dashboard for fast insights. Subsequent blog posts will expand on this dashboard and show you how to move from analyzing historical data to live data. To get started, you’ll need to access Humio Community Edition, which is available at no cost.

4 benefits of a connected workforce in manufacturing

The manufacturing skills gap is projected to leave more than 2 million jobs unfilled by 2030, costing the US economy as much as $1 trillion, according to a report by Deloitte and The Manufacturing Institute. When COVID-19 hit, about 1.4 million people lost manufacturing jobs, according to the report. Although the industry has hired back many workers, hundreds of thousands of positions remain unfilled. On top of layoffs, workers are retiring en masse.

What Is A Business Advisor?

Do you possess valuable insights from your experience leading teams and running operations? You can use these to inspire and motivate leaders - with less experience - as they embark on growing their businesses. A business advisor can play a pivotal role in positively impacting the trajectory of a business by using their experience to provide informed opinions and offer advice.

What Is a Distributed System?

Before you can answer this question, I like to step back and take a good look at the history of computing. Mainframes and legacy client-server applications were monolithic—all the processing took place on a single set of hardware. As hardware grew cheaper, and especially after widespread hardware virtualization came to fruition, these trends gave way to the widespread development of distributed systems.

Running regular security scans with scheduled pipelines

Security is a vital part of application development, yet it may be neglected until an attacker takes advantage of a vulnerability in the system. The consequences of a security breach can damage an application’s integrity as well as a company’s reputation and revenue. Software architects and engineers need to pay special attention to securing the systems they work on.

5 IT Documentation Best Practices in 2022

As the IT world continues to grow in size and complexity, an increasing amount of information is needed for both the operation and security of IT devices. It’s nearly impossible for MSPs and IT professionals to remember everything without writing it down somewhere, but “somewhere” needs to be both secure and readily available to reference. Fortunately, IT documentation software was created to be both.

Getting Ready for a smooth, speedy migration to the Splunk Cloud Platform

This video shows you how a little bit of preparation before you kick off your cloud migration can lead to a speedy, smooth ride. Additionally, this video will help you decide on your migration strategy that is best for your environment and show you how to assess the efforts required for migrating your environment to the Splunk Cloud Platform.

Communicating to Users During Incidents

Imagine you're having a regular day at work, opening up your browser, double checking something for a client in that web app your team built for them, when suddenly, you see this screen: You hit refresh a few times, just to be sure. Nope. Still down. What happens next depends on how well your team has planned for incidents like this (some folks call it unplanned downtime).

The 'Decade of IoT' is off and running

If 2021 was the soft launch of the Decade of the Internet of Things (IoT), 2022 is set to accelerate IoT-related technologies and investments, addressing societal and economic issues. The rollout of 5G, maturing of artificial intelligence algorithms for streaming IoT data, increased computing power at the edge and cheaper/better sensor technology is the “convergence” that has supercharged IoT adoption.

The importance of SemVer for your applications

For some developers, SemVer can look just cosmetic, nice to have, or simply useless. But SemVer format is mandatory to make reliable software. I'll explain how over one year, we encountered 2 issues related to SemVer. The first one was critical and led to a production outage, while the other was a lot of trouble for several companies to upgrade a managed service.

Improving your team's on-call experience

Your engineers probably dislike going on-call for your services. Some might even dread it. It doesn't have to be this way. With a few changes to how your team runs on-call, and deals with recurring alerts, you might find your team starting to enjoy it (as unimaginable as that sounds). I wrote this article as a follow-up to Getting over on-call anxiety.

New in StatusGator: Reordering Services

As our public status dashboards have become more popular, so has the ability to customize them. Over the next several weeks, we will be rolling out a series of features that allow more customization of your dashboard. Already we’ve added custom CSS capabilities. Today, we’re rolling out service reordering. Our new dashboard management page has a slimmed-down look.

When beginning your digital transformation journey, should you invest in machines or in enabling operators?

The term “Industry 4.0” originated from a committee of German technocrats who wanted to make predictions about where technology was headed next. And certainly, it fits nicely in the storyline of the initial Industrial revolution with the rise of machines powered by steam and water, the second revolution sparked by the use of electricity, and the third revolution of automated production with robots.

Sponsored Post

TOP 10 IT Trends

Observability has gained a lot of momentum in the past year, be it full stack observability or data observability. Modern complex IT systems using clouds, microservices and serverless are easy to develop and deploy but extremely difficult to observe. These systems generate tremendous amounts of data and need an automated way of handling the volume. The next era of delivering customer experience is underpinned by the full stack observability capability.

What are CDN Logs and Why Do They Matter

Content Delivery Network produces numerous log files called CDN logs to deliver video across the internet to our homes and mobile devices. These logs contain crucial information about the CDN servers' performance and video streaming quality. Also, it contains terabytes of data, which has its own set of hurdles in terms of handling it in real-time and performing analytics to understand user experience and network concerns.

Deploy ImageLabeller with Bitbucket

Follow along with this step-by-step video series as Warren Marusiak, a senior technical evangelist, demonstrates pushing a code change to production using Bitbucket Pipelines CI/CD. To demonstrate how to develop, deploy, and manage applications using Jira Software and various connected tools, our team created ImageLabeller, a simple demo application built on AWS that uses machine learning to apply labels to images.

Deploy ImageLabeller with Github

Follow along with this step-by-step video series as Warren Marusiak, a senior technical evangelist, demonstrates pushing a code change to production using Bitbucket Pipelines CI/CD. To demonstrate how to develop, deploy, and manage applications using Jira Software and various connected tools, our team created ImageLabeller, a simple demo application built on AWS that uses machine learning to apply labels to images.

Deploy ImageLabeller with Gitlab

Follow along with this step-by-step video series as Warren Marusiak, a senior technical evangelist, demonstrates pushing a code change to production using Bitbucket Pipelines CI/CD. To demonstrate how to develop, deploy, and manage applications using Jira Software and various connected tools, our team created ImageLabeller, a simple demo application built on AWS that uses machine learning to apply labels to images.

Dashboard Fridays: Sample Microsoft Teams Dashboard

Join SquaredUp's Adam Kinniburgh and Purdue University's Daniel Parrott as they showcase this sample Microsoft Teams dashboard used by Purdue to visualize key Microsoft Teams usage metrics for their online classrooms. Built using SquaredUp, this dashboard keeps track of the total number of Teams and which are empty, allowing them to pinpoint issues with the data load to create the teams or update issues. The dashboard also monitors for empty class sections to help identify issues with the class selection process.

Practical Tips & Tricks for Speeding Up Your CI/CD Pipelines

When developing software and maintaining CI/CD and testing pipelines we are often compelled to increase our test coverage by adding more tests, and therefore improve our apps’ quality. After all, more automation equals better software, right? There’s a flipside to this equation however, and a point at which we start seeing diminishing returns from each test we add. Taken to extreme, these diminishing returns begin to actively harm our ability to deliver working software.

Delightful service management turns eight

Eight years back we were inspired to create an employee experience on par with the best customer experience available anywhere. Fast forward to today, we are the only Challenger in the Gartner magic quadrant for ITSM tools, enabling customers to delight their employees with a modern and right-sized service management solution.

Top 10+ Best System Monitoring Software & Tools [2022 Comparison]

It’s virtually impossible to manage today’s complex IT environments at scale without a comprehensive system monitoring solution that allows you to check the health of all your applications and services from a single pane of glass. When your end users are experiencing difficulties, you must have such a tool in place that lets you quickly ascertain and remediate the root cause of the slowdown or error.

Continuous Software Pipelines: Why Enterprises Are Going Cloud-Native 2021 Dev Week Cloud Keynote

Why are enterprise organizations making a move from on-premise solutions to completely cloud-native? What does that mean for improving, scaling, and securing their CI/CD pipelines? And what exactly is continuous packaging, anyway? Join Dan McKinney in this Dev Week Cloud session he answers all of these questions, helping attendees understand the true difference between cloud-hosted and cloud-native, how to get started with migrating to a cloud-native solution, and the true benefits of being entirely within the cloud.

Cloud-Native Pipelines: Secure Software Delivery, Made Simple Dev Week Cloud Workshop Session

Your entire tech stack is likely in the Cloud - so why aren’t your software packages? Whether you’re currently on-premise, have your own in-house solution or have a bit of a hybrid set up, join us in this session to explore why the future is cloud-native, what the benefits of this are over cloud-hosted, and how to easily set up a secure, cloud-native software pipeline in 60 seconds.

"Build It Yourself, They Said. It Will Be Worth It, They Said" Dev Week Enterprise Keynote Session

“We’ll build it ourselves!” We’ve all heard it, seen it, and likely been directly impacted by the decision to build a custom, in-house solution rather than use an existing one. Whether it’s a CI/CD tool, artifact management solution, or even the entire DevOps tech stack, it’s a common misconception that building it internally is easier, cheaper, and faster. When, in fact, the complete opposite is true!

Continuous Software Pipelines: Why Enterprises Are Going Cloud-Native Dev Week Enterprise Open Talk

Your entire tech stack is likely in the Cloud - so why aren’t your software packages? Whether you’re currently on-premise, have your own in-house solution or have a bit of a hybrid set up, join us in this session to explore:- Why enterprise organizations are making the move from on-premise solutions to completely Cloud-Native ones- What this means for improving, scaling, and securing their CI/CD pipelines- What the benefits of this are over cloud-hosted- How to easily set up a secure, cloud-native software pipeline in 60 seconds.

Package Management for Gaming Software Development

There is huge scope required when building video games. They are not just computer programs; they’re audio-visual artistic works. It’s a collaborative effort between software engineers, animators, scriptwriters, graphic designers, photographers and sound engineers. Working with these collaborators and assets leads to a different software pipeline than the average software project.

How to Develop and Deploy AI/ML Workloads at Scale - Prototype to Production in Days, not Months

Explore how organizations can develop and deploy machine learning (ML) workloads at scale on top of Kubernetes in NVIDIA DGX systems, while satisfying the organization’s security and compliance requirements, thus minimizing operational friction and meeting the needs of all the different teams involved in a successful ML effort.

Why Is Equipment Calibration Management Is Important for Manufacturers?

Calibration is a crucial factor especially for organizations related to manufacturing, food products, or packed food. Calibration is one of the least known yet one of the most crucial factors for several major industries. Calibration management enables you to get more control over the procedure. It also assists with better equipment management! In this blog, we will cover all major points related to calibration management! So, let us begin!

Getting over on-call anxiety

You've joined a company, or worked there a little while, and you've just now realised that you'll have to do on-call. You feel like you don't know much about how everything fits together, how are you supposed to fix it at 2am when you get paged? So you're a little nervous. Understandable. Here are a few tips to help you become less nervous.

Don't Fear the Automation

Automation is often portrayed as a scary thing. Whether it’s artificially intelligent robots conquering the planet or a world where no one can find work because automatons have taken everyone’s jobs, movies and entertainment often paint automation with a horror brush. But IT pros know this isn’t the reality. They know automation is a tool capable of making their lives easier and helping their organizations run smoothly. Still, many IT pros fear automation for legitimate reasons.

11:11 Systems Completes Acquisition of iland

Combined Offering to Unlock the Power of Connectivity, Cloud and Security. Newly-formed 11:11 answers cloud market's call for a single, trusted vendor to manage and monitor hybrid infrastructure. iland and previous acquisition, Green Cloud Defense, to serve as core ingredient platforms in 11:11 market-leading offerings.

How to Improve Your Digital Workplace

Over the past couple of years, the workplace has gone digital. Yep, that's right: millions of employees now work from home through their computers and other devices. In addition to this, employees that have remained in the office are also highly reliant on digital tools. Unless there's a giant earthquake that wipes out the world's internet servers, it's safe to safe that the digital workplace is here to stay - and it's something you should be very excited about (especially if you're a business owner).

Datadog NPM now supports Consul networking

Consul is a service networking platform from HashiCorp that helps you manage and secure communication between microservices. You can use Consul with Kubernetes, and it supports on-prem, hybrid, and multi-cloud architectures. Consul service mesh provides a control plane which allows you to automate the management of traffic between your services via features like service discovery, DNS, load balancing, and routing.

Incident Response: Virtual agent capabilities in app video

Virtual agent has two primary functions: to transfer user to a live agent and provide case status. In this video we’ll show you how both of those functions work. For more information on Incident Response, see: Your feedback helps us serve you better. Did you find this video helpful? Leave us a comment to tell us why or why not.

How the IT service provider q.beyond thrives with Icinga

We are proud of our many customers and users around the globe that trust Icinga for critical infrastructure monitoring. That´s why we´re now showcasing some of these enterprises with their Success stories. It´s stories from companies or organizations just like yours, of any size and different kinds of industries. Some of them are our long-standing customers, others have just recently profited from migrating from another solution to Icinga.

Our Selenium Synthetic Monitoring Stack

We maintain a highly optimised browser automation stack in order to provide the most stable environment for our customers to run their Selenium scripts in. Our goal is to deliver the best user experience for writing and maintaining a synthetic script and configuring the browser environments it runs in. The synthetic monitor data we produce is used for simulating website processes such as form-based authentication, eCommerce transactions, and regulatory checks.

Harnessing AIOps to Improve System Security.

You’ve probably seen the term AIOps appear as the subject of an article or talk recently, and there’s a reason. AIOps is merging DevOps principles with Artificial Intelligence, Big Data, and Machine Learning. It provides visibility into performance and system data on a massive scale, automating IT operations through multi-layered platforms while delivering real-time analytics.

Improve Apache Spark performance with the S3 magic committer

Most Apache Spark users overlook the choice of an S3 committer (a protocol used by Spark when writing output results to S3), because it is quite complex and documentation about it is scarce. This choice has a major impact on performance whenever you write data to S3. On average, a large portion of Spark jobs are spent writing to S3, so choosing the right S3 committer is important for AWS Spark users.

Ocean Insights now available for Google Cloud

As companies move more applications into the cloud, and package them into containers, environments become more complex with limited visibility. While infrastructure is abstracted away as much of it is delivered by the hyperscalers, this creates an opaqueness that makes it hard to control costs and understand resource utilization. As a result, many companies are experiencing high cloud bills and lots of cloud waste.

The 10 Best AWS Migration Tools (Updated 2022)

Moving large amounts of data to the cloud can be arduous and time-consuming. A cloud migration would take years if engineers manually moved data from assessment through mobilization and migration phases. An effective cloud migration also requires adequate data encryption, fast data transfer speeds, and constant monitoring. Migrating workloads to AWS requires you to monitor costs in real-time as well to avoid overspending.

Cloverleaf and Customized Management Packs

Every business is different, and every IT environment has its own set of challenges. Customized SCOM Management Packs are created to meet the monitoring and automation requirements of your company's critical applications. OpsLogix offers a wide range of off-the-shelf monitoring products, but for companies working with niche applications, these are not always applicable.

Using automation and monitoring for documentation

I often have discussions with N-able partners who need help capturing data for either regular reporting or just to have when a customer asks for it. Often people will use a tool like BrightGauge to pull data from their RMM platform and generate dashboards and reports. However, these can be overly complex if you only need to capture the data so you have it and can act upon it. What I regularly recommend is that you create a monitoring script to capture just the data that is needed.

How to save on your Azure Monitor and Log Analytics Costs

Thomas Stringer has a couple of great blog posts on how to understand your Azure monitoring costs and also on how to reduce your costs, see Azure Monitor Log Analytics too Expensive? Part 2 – Save Some Money | Thomas Stringer (trstringer.com). In the past I’ve blogged on How to calculate the Azure Monitor and Log Analytics costs associated with AVD (not an easy task!).

The Business Case for Observability and Site Reliability Engineering

Unlike traditional IT Ops, the role of the SRE isn’t simply focused on finding and solving technical problems. The big win for today’s SREs is supporting the organization’s strategic innovation initiatives. With the appropriate observability capabilities, it’s possible to quantify the value that software infrastructure contributes to this innovation effort.

Get the most of your .Net Builds

Give your.Net ecosystem the full power of DevOps running on AWS - The JFrog Platform covers the full application lifecycle of.NET builds from developer fingertips through distribution to consumers while covering application security, vulnerability analysis and artifact flow control. In this webinar will see how you can configure your.NET builds on AWS, so that they take full advantage of JFrog Platform for managing the lifecycle of your.NET artifacts.

The Top 5 Use Cases for AIOps Today

By now, you’ve likely heard of AIOps, a technique that promises to inject new levels of efficiency into IT operations with the help of AI and machine learning. But what, exactly, does AIOps mean in practice? Which specific use cases can IT organizations enable or improve with the help of AIOps? Those may be more difficult questions to answer if you have yet to see AIOps at work in your organization.

Kickstart your Splunk App with @Splunk/Create

I’ve been contributing to, and creating, Splunk apps for the better part of the last 10 years. But never have I felt more excited to be a Splunk Developer than right now. One of the primary reasons why I am so excited is because of build tools like @splunk/create. At Splunk, we recognize that developers are so crucial to our entire ecosystem.

LogStream for InfoSec: VPC Flow Logs - Reduce or Enrich? Why Not Both?

In the last few years, many organizations I worked with have significantly increased their cloud footprint. I’ve also seen a large percentage of newly launched companies go with cloud services almost exclusively, limiting their on-premises infrastructure to what cannot be done in the cloud — things like WiFi access points in offices or point of sale (POS) hardware for physical stores.

End-User Monitoring: How it Can Impact Your Business

After the global pandemic and lockdowns, most businesses look onward to online solutions for their applications. They want to take their business online by creating mobile or web applications. Companies are looking to monitor every aspect of the application, including deployment, bugs, API failures, etc. But the most important thing is how the application behaves when it goes into the hands of the end-users.

AI-Powered Monitoring Could Have Saved Millions for Global Bank

As most people were preparing to celebrate the new year, the UK’s Santander Bank was dealing with a crisis. On Christmas day, roughly 75,000 people who received payments from companies with accounts at Santander Bank received a duplicate payment transaction. The total damage amounted to £130m, and recovery in these situations is a painful process for both the bank and its customers.

Intro to Torq: Chatbots

The challenges and workloads facing today’s security teams are not getting easier, but the response methods of security teams are still manual, utilizing a patchwork of security tools that are not connected nor communicating with each other. What if you could utilize your organization’s most common communication tool (i.e. Slack) to bring security communications and operations into every part of your organization?

Get Started with Tanzu Community Edition and VMware vSphere Using Kube-VIP

In this video, we go over VMware Tanzu Community Edition with VMware vSphere using a simple network configuration and managed clusters. You will learn step-by-step what is needed to quickly get started. This video utilizes Kube-VIP as the networking choice but will explain all of the available options.

Ask Miss O11y: Long-Running Requests

You need not fear a long-lived streaming workload. A few simple tricks can transform a request that may not ever terminate for hours or days into something you can get regular health and status updates on. We in fact have one of those continuous processing services—Beagle, our Service Level Objective stream processor—which we’ve instrumented in this fashion.

Is Your Zero-Trust Solution Missing Half Your Network?

With hybrid work environments on the rise, enterprise networks are dealing with multiple remote connections, increasing the risk of breaches and other attacks. One way to mitigate these risks is by implementing Zero Trust Architecture (ZTA) into the enterprise network. Unfortunately however this does not address the full set of threats to the enterprise, as many of the zero-trust service providers and on-premises network solutions do not address the voice network.

Monitoring AWS Spot instances using Sumo Logic

Spot worker nodes on EKS (Elastic Kubernetes Service) are a great way to save costs by allowing customers to take advantage of unused capacity. With Sumo Logic, we have experimented with and adopted spot worker nodes for some of our EKS clusters to see if we can pass along the same benefits. We decided to share some of the learnings, challenges, and caveats with using spot instances along with the monitoring setup.

How to use OpManager as an effective disk space monitor for your network monitoring environment

Disk space availability in servers is crucial. Applications that run on these servers save log files and write data to a database that is also installed on the server; if there isn’t enough disc space, the application may not work properly and may crash. Monitoring disc space is critical for IT administrators to maintain server performance and network availability by preventing a sudden and unexpected lack of server disc space.

What is MTTR? Resolve incidents faster through ops, alerting and documentation

When downtime strikes any distributed software deployment or platform, it’s all hands on deck until the lights are green and service is restored. This process, from the recognition of a problem to a deployed solution, has most commonly been defined as MTTR — mean time to resolution. In just the last few years, DevOps and site reliability (SRE) professionals have developed sophisticated new models for how they work and audit their successes.

Why you need network monitoring?

Network monitoring is a continious analysis of a network to detect and correct any performance issues. Network monitoring involves collecting network statistics to determine the quality of services offered by the network. With tools like Icinga, it’s possible to monitor hardware and software in your network. Espacially in the pandemic when many employees work from home, it’s good to have a tool which checks the network permanently.

A single pane of glass for automatic incident response for Bridgeport Public School District

“I have been doing this for 20+ years and have been using literally every product out there. Derdack is unique at how issues are addressed and communicated out because of the seamless integration, maturity and flexibility of the platform. Working with Derdack has been a game changer for us and helped us to do more with less.” Jeff Postolowski, Director Information Technology Services, Bridgeport Public School District

Wisdom of the Crowds: The Value of User Sentiment Observability

What’s the first thing most people do when they’re unhappy with a business? Take to social media to complain about it. Observing those comments – otherwise known as “user sentiment observability” – gives you a head’s up as to when problems become big enough to impact user experience. How can you monitor that voice of the customer? And why is it important to do so? Let’s take a deeper look at the issues.

8 Best Sentry Alternatives You Should Try

Selecting the best sentry alternatives for error monitoring is likely to be difficult. It might be difficult to sort between the features, benefits, and drawbacks of many software companies and sellers. Let's talk about how to make this process easier by looking at the eight best alternatives to Sentry. Sentry is open-source application performance and error-tracking tool that allows developers to track and fix errors in real-time.

A Complete Introduction to SQL Server Transactions

One of the most fundamental concepts in any relational database management system (RDBMS), such as SQL Server, is the transaction. During my consulting career, I've seen many performance problems caused by developers not understanding how transactions work in SQL Server. In this tutorial, I'll explain what transactions are, why they're necessary, and how they work in SQL Server.

How to become HIPAA compliant on AWS in 2022?

Since the 90s, when you run a company in the Healthcare industry in the US market, you must comply with the Health Insurance Portability and Accountability Act (HIPAA) Security Rule. Some of the security rules are directly linked to how you operate your organization, the others how you manage your application data for your customers. This article will walk you through what to consider on AWS to be HIPAA compliant in 2022.

APM Insight: 2021 in Review

2021 was the year of hybrid work. An interesting year full of hope and thoughtfulness, 2021 saw increased office-based collaborations complementing our diverse remote workforce. Through this, we sought to look within to sort our processes and deliver what it takes to ensure the best monitoring experience for our customers worldwide. Here is a quick recap of the APM features we rolled out last year and a brief note on our plans for 2022.

Get Started with Playbooks Permissions

The goal of Mattermost Playbooks is to help teams consistently orchestrate any and all recurring workflows. A Playbook is a prescribed, repeatable process that a team has agreed on and formalized as a collaborative checklist saved on their Mattermost server. We at Mattermost use Playbooks for incident collaboration, customer onboarding, and product releases, along with many other complex processes.

No Internet? No Problem. Use Xray with an Air Gap - Part II

With software supply chain attacks on the rise, implementing DevSecOps best practices in an air gapped environment is a must. In an effort to secure an organization’s internal network, there is an increasing trend of separating the internal network from the external one. Essentially creating an enclosed and disconnected environment from the public internet. An air gapped solution provides stricter security requirements, but that’s not enough.

Rundeck Office Hours: Best Practices for Setting up Rundeck ACL's

Presented by Nathan Fleugel, December 9, 2021 Register to attend Rundeck’s monthly community office hours. Each 60-minute session will focus on a Rundeck topic followed by a live Q&A. Join us this month for an AMA discussion followed by a live Q&A led by technical experts from Rundeck’s engineering, product, and solution engineering teams. Experts are available to provide advice on your technical architecture, give recommendations for operational best practices, review current Github issues, or dive into the open source code itself.

Unpack Kubernetes Blindspots in 5-Minutes

With Kubernetes being used to scale the operation of containerized microservice architectures, mass amounts of metrics are being delivered on a continuous basis. That continuous stream of data damages the possibility of ensuring the performance of your IT footprint. Especially if the proper tools are not in place to pinpoint missing data. Humio’s ability to gather deep insight from unstructured and structured data provides the visibility needed to highlight what is unseen.

Show it Off with Splunk TV! More Ways to Display Your Best Dashboards

Splunk TV lets you easily display your data on the big screen to visualize and monitor what’s going on in your business. Splunk TV is optimized for a hands-off experience, with slideshows and automatic scrolling so you can display the most important metrics securely and easily. We’re happy to announce that in addition to Classic (Simple XML) dashboards, we now support Studio Dashboards and IT Service Intelligence Glass Tables.

A beginner's guide to network monitoring with Grafana and Prometheus

Networks are the backbone of inter-communications within computer systems and applications. When networks go down or experience any interruption of service, the impact is widely felt and can result in significant service disruptions and lost revenue. This is why network monitoring is mission critical for organizations. Visibility into network performance is key to ensuring that network engineering teams can be more proactive and identify problems before those issues cause outages.

A Developer's Guide to Continuous Performance Testing

One of the most important phrases of DevOps practices is “Test early, test often.” It’s crucial to perform functional testing early with unit tests and integration tests. But it’s equally important to perform non-functional testing. That means you should have performance tests. As markets become more saturated with each passing day, you no longer have the luxury to postpone performance testing until all features are developed.

Webhook, Pub/Sub, and Slack Alerting notification channels launched

When an alert fires from your applications, your team needs to know as soon as possible to mitigate any user-facing issues. Customers with complex operating environments rely on incident management or related services to organize and coordinate their responses to issues. They need the flexibility to route alert notifications to platforms or services in the formats that they can accept.

Creating custom notifications with Cloud Monitoring and Cloud Run

The uniqueness of each organization in the enterprise IT space creates interesting challenges in how they need to handle alerts. With many commercial tools in the IT Service Management (ITSM) market, and lots of custom internal tools, we equip teams with tools that are both flexible and powerful. This post is for Google Cloud customers who want to deliver Cloud Monitoring alert notifications to third-party services that don’t have supported notification channels.

9 Types of Phishing and Ransomware Attacks-And How to Identify Them

Cyberattacks have become more pervasive globally, evolving quickly in sophistication and scale, and are now more lucrative than ever for cybercriminals. Not only has The Everywhere Workplace extended the cyber risk and threat landscape—especially for data privacy and its protection—but a lot of Agile software developers, many of whom lack any DevSecOps process, are publishing untested or poorly tested software that can be exploited as zero-days by criminal gangs.

Monitoring Endpoint Logs for Stronger Security

The massive shift to remote work makes managing endpoint security more critical and challenging. Yes, people were already using their own devices for work. However, the rise in phishing attacks during the COVID pandemic shows that all endpoint devices are at a higher risk than before. Plus, more companies are moving toward zero-trust security models. For a successful implementation, you need to secure your endpoints.

ICYMI: Honeycomb Developer Week: The Partner Ecosystem

We know that you value collaboration. That’s why we share incident reviews and learnings—because we believe the entire community benefits by working together transparently. In the spirit of working better together, we invited ecosystem partners from ApolloGraph, Cloudflare, LaunchDarkly, and PagerDuty to present at Honeycomb Developer Week, a three-day event filled with snackable, time-efficient learning sessions to help you uplevel your observability skills.

RPA vs. ITPA - What's the Difference?

The promise of robots integrating into our everyday lives has long been on the minds of forward-thinkers and visionaries. Perhaps none were quite as bold in their future predictions as American chemist and Nobel Laureate Glenn T. Seaborg, who envisioned a 21st century in which every home would not only have its own robot, but also an intelligent species of animals that could help with household chores.

How business optimization efforts stack up

Like many business leaders, you may be wondering if you’re getting the most out of your technology investments and if there are things you can do to gain efficiency. Many organizations have taken steps toward optimization to reap the rewards of: In fact, 35% of organizations have made significant or very significant progress toward optimizing risk management and cybersecurity, according to a global survey of 900 senior business leaders by ServiceNow and ThoughtLab.

Consulting Business Models (3 Potential Paths)

Consulting business models can make or break the success of firms. Before you decide on a business model for your consulting firm, consider the following questions: The right professional services business model will support scalability and profitability. If you're actively considering business consulting models and find yourself at a crossroads, this is the perfect article for you to read. A consulting firm's business model can determine the trajectory of the organization.

Introducing Flexible Subscriptions: Websites Are Dynamic, Monitoring Should Be Too

Have you ever felt limited or “locked into” a fixed SaaS subscription plan? Have you ever been forced into a Sales call only to struggle with the decision – and costs – of upgrading to a higher plan tier to add incremental features or usage you need? Are you subscribed to a SaaS plan today that’s chock-full of features or capabilities you’ve never used (or asked for!) – but are still paying for? If so, you’re not alone.

Modernizing Government Technology: How Federal Agencies Are Progressing on Technology Transformations

When the U.S. Congress passed the Modernizing Government Technology Act (MGT) of 2017 as part of the 2018 National Defense Authorization Act, it established both funding and a process intended to help bring aging federal IT systems and infrastructure up-to-date with state-of-the-art technologies common in the private sector. According to the legislation, the goals of MGT are to.

Continuously Securing Software Supply Chain

Catch this session to see a breakdown of the recent news related to software supply chain security and what you can do to meet new requirements and protect your software from such attacks. With new software supply chain attacks reaching the spotlight at an accelerating pace, security research uncovering novel attack methods and new mandates and guidelines starting to come into effect — it can be hard to stay on top of the latest developments and their implications.

GitHub Actions and Shipa Webinar

As a software engineer, leveraging GitHub is one of those tools that transcend your personal and professional development activities. Bringing open source to the masses, GitHub is a familiar platform for many. A newer addition to the GitHub Platform is GitHub Actions which was originally a workflow engine, now expanding into CI/CD. Combining the ubiquity of GitHub Actions to your GitHub project/repository with the powerful application abstractions that Shipa provides is a great developer experience. Watch this webinar recording combining the power of the two platforms.

mooving to...Remix

Learn from Kent C. Dodds, Co-Founder, Director of Developer Experience at Remix, about moving to Remix, a new way to build websites. This 30-minute conversation will review the ups and downs of accessible, scalable, performant apps. Discover tips to achieve the ability to write code once and deploy anywhere. Hear first hand how to build: New full-stack web framework for modern UX Fully fused frontend and backend of any website Simplified code without sacrificing usability

Introducing wachy: A New Approach to Performance Debugging

Wachy is a new Linux performance debugging tool that Rubrik recently released as open source. It enables interesting new ways of understanding performance by tracing arbitrary compiled binaries and functions with no code changes. This blog post briefly outlines various performance debugging tools that we commonly use, and the advantages and disadvantages of each. Then, we discuss why and how we built wachy.

API performance testing with k6

Performance testing measures how well systems perform when subjected to various workloads. The key qualities being tested are stability and responsiveness. Performance testing shows the robustness and reliability of systems in general, along with the specific potential breaking points. In this tutorial, you will use k6 to do load testing on a simple API hosted on the Heroku platform. Then you will learn how to interpret the results obtained from the tests.

What Are the Best Practices for Inventory Auditing?

Usually, audits are a long procedure. It might take days or months to complete the audit process. However, still, there is no surety that the audit is done with full efficiency because some assets have gone to some other locations. But audit is performed to get deep analytics of finance-related information. Not just that it also provides several other crucial information as well related to assets & inventories. An audit is helpful in maintaining & following compliance as per the rules and regulations.

Why Heroku and AWS have failed to serve modern developers?

Heroku Vs. AWS remains a long and persistent debate among developers. Both platforms have strengths and weaknesses. Over the last 10 years, Heroku and AWS played a huge role in the cloud hosting and software development industry, by significantly unlocking productivity in a way that it has never been reached before. They are the platforms behind most of the successes from the last decade.

Are there good hackers?

Hello and welcome back to our “Mystery Jet Ski.” Much better than those programs about supernatural stuff and alien suppositions. Today we will continue with our exhaustive investigation on the hacker world, and we will delve a little more into the concept of “ethical hacker.” Is it true that there are good hackers? Who are the so-called “White hats”? Who will win this year’s Super Bowl?

Just How Important Is Your Integration Infrastructure?

Most companies take their integration infrastructure for granted. I’m talking about middleware such as IBM MQ, Kafka, Solace, ActiveMQ, RabbitMQ. These form the basis of most enterprise-level businesses. One of our electronic manufacturing customers was building products worth $40K per minute. A failure in one of the factory floor’s automated systems brought manufacturing operations to a complete halt.

Best Practices for IT to Support Hybrid Work in 2022

I hate to say this, but #Omicron is at the doorstep. According to the CDC website, there have been over 60M cases in the US so far. As a result, companies like Google and Apple are delaying returning to the office while some call the return date as now ‘history’. Although we cannot predict the nature of the virus, we have some best practices to help our customers and IT manage their employee experience in a hybrid distributed environment.

Getting the best out of Samsung Knox management with Mobile Device Manager Plus

In case you missed it, Samsung Knox has verified Mobile Device Manager Plus as a Knox Validated Partner solution. This means that our EMM solution meets its business-level requirements for 2022, and that we support a wide range of features to help you get the best out of all your mobile devices that support Samsung Knox capabilities.

Sponsored Post

What is Incident Response?

When a service is down, a system is failing, or a security issue is in the midst of occurring, organizations need a solid incident response process to get up and running again. Incident response isn't just for high severity, lights out incidents either; if you've rebooted your computer to fix a problem, you've been an incident responder yourself! Incidents happen, and any successful organization knows that instead of pretending that one day nothing will ever go wrong, it's far more useful to develop a comprehensive operational response plan. And to do so, you need to know what incident response is! Let's get into it.

Monitor Dell EMC Isilon with Crest Data Systems' integration in the Datadog Marketplace

Dell EMC Isilon is a petabyte-scale network attached storage (NAS) system that allows you to archive unstructured data. Isilon operates in a cluster to provide high availability, and you can scale up its throughput, IOPS, and storage space by adding nodes to your cluster. Isilon automatically replicates your data throughout the cluster to ensure durability and provides caching to minimize data retrieval latency.

Azure Active Directory (Azure AD) - 101

This is a multi-part series that covers monitoring Microsoft Azure Active Directory (AD). In this blog post, which is part 1 of the series, you will learn about and understand Microsoft Azure Active Directory (Azure AD) and how it is different from an on-premises Active Directory (AD). As technology keeps evolving, companies increasingly look to technologies like Cloud Computing to expand, modernize and stay competitive, and in doing so companies can expose themselves to risks.

Diving Under the Hood With Our New 'Node Status' Feature

More than anything else, Kubernetes troubleshooting relies on the ability to quickly contextualize the problem with what’s happening in the rest of the cluster. As complicated as this may sound, SPEED is really the name of the game. After all, more often than not, you will be conducting your investigation under the glow of fires burning bright in production. Getting relevant context quickly and seeing things holistically is exactly what Komodor was created for.

What is ServiceOps?

ServiceOps is a new business technology strategy that combines IT service management (ITSM) with IT operations management (ITOM). ServiceOps is fundamentally about connecting people, processes, and technology that are dependent on one another to enable successful service delivery and make user experiences better with automation, collaboration, and visibility across traditionally fragmented departments.

Improve Incident Response by Getting Control of Your (Unintelligent) Swarm

Incidents happen. Things go wrong. Systems fail. Sometimes they fail in unexpected and dramatic ways that create Major Incidents. PagerDuty makes a very specific distinction between an incident and an Incident. Your organization may also make such a distinction. Determining if an incident is major or not can come down to a number of factors, or a specific combination of factors, like the number of services affected, the customer impact, and the duration of the incident.

How to Build the Ultimate Database Monitoring Dashboard

Using the PerfStack feature in the Orion Platform, we will show you how to create the ultimate database dashboard that allows you to correlate performance across your entire IT stack. Quickly see if the problem truly is a database problem or resides somewhere else, such as the networking or system layers.

How to Import/Export Orion Alerts

The Out of the Box alerts on the Orion Platform are good, but alerts for specific needs are better. The flexibility of the alerting engine in the Orion Platform is one of the best ways to tune the platform to your needs. We'll show you how to import alerts from THWACK and how to share any of your excellent alerts with the SolarWinds community. Working with real-world examples is just another way to have your Orion Platform performing properly for your organization.

Have You Forgotten About Application-Level Security?

Security is one of the most changeable landscapes in technology at the moment. With innovations, come new threats, and it seems like every week brings news of a major organization succumbing to a cyber attack. We’re seeing innovations like AI-driven threat detection and zero-trust networking continuing to be a huge area of investment. However, security should never be treated as a single plane.

Telegraf Best Practices: Config Recommendations and Performance Monitoring

Telegraf has reached the ripe old age of V1.21.2. Thanks to community feedback and contribution, there have been many features added over the years. Lately, I have seen these questions pop up. If any of these questions plague your mind, have no fear — this blog is here to help! Here are my golden rules for maintaining best practices when building your Telegraf solution.

Report: Flawed Hiring Technology is Exacerbating 'the Great Resignation'

As “the Great Resignation” continues, employee turnover remains a key challenge and source of anxiety for businesses around the world. In the U.S. alone, last November saw a record 4.53 million workers quit their jobs. But oddly, hiring teams are scrambling to fill open positions. In theory, a mass exodus of employees leaving their jobs would mean there’s an overflow of qualified candidates in the hiring market. But new research suggests that isn’t the case.

How to create an Azure storage account

So you finally are ready to try out some of Azure’s most common and loved technologies.Well the cheap storage they provide is definitely one of them that we do. Whether you want to store your own personal files and photos, or you have a requirement from your business to quickly expand some storage, with Azure its quick and easy to setup, access and use. In this quick tutorial I will run you through just how easy it is to setup an Azure Storage Account.

Helping customers answer the trillion-dollar digital transformation question

As a former chief experience officer, I’ve seen just how impactful digital transformation can be. When I worked at Under Armour, we focused on our customers’ connected fitness journey, which unlocked new revenue streams and engagement opportunities. Yet, I’ve been around long enough to see my fair share of disappointments and underperforming projects. Any C-level executive will say the same.

Why COGS Isn't The Most Relevant Cost Metric For SaaS Companies

For most SaaS companies, COGS (which stands for cost of goods sold) is used to calculate gross margin and profit. COGS is an accepted term with a specific definition under U.S. Generally Accepted Accounting Principles (GAAP) — and is widely used as part of calculations to gauge the health and valuation of a company. Like many accounting practices, COGS stems from the industrial era, when most businesses were concerned with the creation of physical goods.

The Biggest Developer Platform in the World - GitHub Actions and Shipa 101

As a software engineer, leveraging GitHub is one of those tools that transcend your personal and professional development activities. Bringing open source to the masses, GitHub is a familiar platform for many. A newer addition to the GitHub Platform is GitHub Actions which was originally a workflow engine, now expanding into CI/CD.