Building Technical Operations

Building and running a large production system is difficult. This blog is for people who build and manage large Internet and intranet systems and services

Saturday, March 29, 2014

7 Tips For Successful PCI DSS Audit

›
In the light of my recent experience leading Instart Logic through the process of PCI DSS Level-1 certification , and some experience im...
2 comments:
Saturday, November 30, 2013

Vendor Management Tips

›
Overview This post is for technical operations folks managing vendor contracts. For technical people dealing with vendors and legal ...
1 comment:
Sunday, June 16, 2013

What to monitor on a Linux box

›
This article is a kind of reminder for me (and for anyone else managing a monitoring system) about which metrics should/can be monitored fo...
4 comments:

Metrics, metrics, metrics...

›
A lot has been said about the importance of system and application metrics - I'll not repeat this, and will concentrate on of-the-shelf...
Thursday, June 28, 2012

The art of crontab jobs monitoring, part II

›
In part I of the discussion I've outlined some basic principals of crontab jobs monitoring. One of the biggest disadvantage of the des...
2 comments:
Saturday, May 26, 2012

The art of application logging

›
There are three frequently met problems with logging functionality in in-house software: Application developers don't really care a...
6 comments:
Saturday, April 21, 2012

The art of crontab jobs monitoring

›
In a regular production or development environment there are normally a lot of crontab jobs configured on running servers. The jobs can be a...
1 comment:
Saturday, November 5, 2011

Don’t hesitate to ask your R&D for software documentation!

›
If you, as an Operations Engineer, have experience working with third-party software, it is most likely that you enjoy a full set of docume...
5 comments:
Thursday, October 13, 2011

Remote Control for Your Production Site

›
Have you ever found yourself rushing to the office data center in the middle of the night just because a critical server is down, and you do...
Saturday, October 8, 2011

Have a problem managing your tasks? Read this post!

›
If you are lucky enough to handle all your tasks and requests in time, without a need to manage and prioritize a backlog, then you probably...
8 comments:
Sunday, October 2, 2011

My monitoring tools set

›
Now it is time to talk about my set of monitoring tools suitable to monitor medium and large systems. Nagios monitoring service I just l...
8 comments:
Friday, September 30, 2011

Handling of external monitoring alerts

›
If you have an Internet-facing production system it is always wise to use an external web availability and/or performance monitoring service...
2 comments:
Sunday, September 25, 2011

Why you need an Operations Change Log

›
This post will explain why you need to manage an Operation Change Log, and will describe a simple and effective method on how to accomplish ...
Friday, September 23, 2011

Email and SMS alerting policy

›
In a regular production environment there are normally three types of monitoring events and corresponding notification methods: Critical al...
6 comments:
Thursday, September 22, 2011

Assigning DNS names to IP addresses

›
Why is it so important to configure proper DNS names for all used IP addresses? A few reasons: To make people remember the names and not ...
10 comments:
Monday, September 12, 2011

My list of favorite Nagios check scripts

›
Nagios is a great monitoring tool - I used it to monitor networks with hundreds of hosts and thousands of service checks. One of the biggest...
Saturday, September 3, 2011

Equipment naming convention

›
Why it is so important? Having a clear and meaningful equipment naming convention will help you to: minimize human errors of executing ...
2 comments:
Monday, August 29, 2011

How to start?

›
Some people ask me - how to start building a production system, and make sure that it will be reliable, scaleable and manageable when it wil...

Operations requirements for in-house R&D products

›
This post will help you to define specific requirements from Operations to R&D for all in-house software provided for production depl...
2 comments:
Thursday, June 23, 2011

Knowledge and information management

›
Knowledge and information management is one of the building blocks in an effective Operations department. It is really important to get a...
13 comments:
›
Home
View web version

About Me

My photo
Victor Gartvich
SFBA, United States
I'm a professional in the technical operations field with an extensive working history as an operations manager, network and system administrator
View my complete profile
Powered by Blogger.