Universal Storage Collector

First, I have created a VNX Collector. Then I have created a VPLEX Collector. This systems are very similar. My next task was periodically extracting performance data from NAR files. At that moment I decided to create an universal tool to collect performance data from different storage systems with output to different collectors. Later I decide to add gathering performance data from EMC VMAX.

And now after some month of work I present Universal Storage Collector — a modular and flexible tool.

What is it

Modular tool. This mean that it is easy to add an extractor for new types of storage systems, or add output to new data collector. This is modularity by source code. It does not use binary plugin at this time. But it is easy to add new code.

Flexible tool. This mean that we could work with different storage systems. Each storage system has its own extractor, output to its own collector, and has its own interval for gathering.

Extractors. At this time this tool could extract performance data from:

EMC CLARiiON/VNX block using naviseccli
EMC Celerra/ VNX file
EMC CLARiiON/VNX block using NAR files
EMC VPLEX
EMC VMAX from Unisphere reports

Output. At this time this tool could output to Carbon (Graphite) and InfluxDB.

How it works

This tool has one configuration file. At this file we specify:

List of extractors with its common parameters.
List of outputs with its parameters.
List of storage systems we would work with.

For each storage system we should specify:

name
class — name of storage system type (VNX, VMAX, VPLEX and so on)
type — type (block or file) for the unified storage systems (optional)
interval — time interval between polling
extractor — one from the list with concrete parameters for this storage system
output — one from the list

Combinations of name, class and type are unique. We could have (array1, vnx, block) and (array1, vnx, file), but we could not have two (array1, vnx, block) with different extractors. Only one will survive. The first one from the list.

After start, this tool:

check each extractor definition
check each output definition
check each storage system definition and create concrete extractor and output exemplar for each of them
create the list of actors, each actor is a concrete storage system
‘ask’ each actor in its interval

When each actor receive ‘ask’ signal, its begin transmission to output, send ‘ask’ message to its extractor, and then stop transmission.

When each concrete extractor receive ‘ask’ signal it extract data from concrete storage system and send its directly to concrete output.

Setup

To start this tool, we should:

create a home directory for it
create a conf subdirectory and put configuration file collector.xml into it
create a log subdirectory for error log file
create a pool subdirectory, if you plan to use extractor from VNX NAR files
specify USC_HOME environment variable with home directory

The source code of this tool is available here https://github.com/vzaigrin/UniversalStorageCollector

Один ответ на “Universal Storage Collector”

Уведомление: VPLEXCollector – tool for gathering EMC VPLEX | Vadim Zaigrin

Уведомление: VNXCollector – DIY EMC VNX Monitoring and Reporting | Vadim Zaigrin

Уведомление: Extracting data from NAR files | Vadim Zaigrin

Уведомление: Extracting performance data from EMC VMAX | Vadim Zaigrin

Уведомление: Extracting performance data from EMC RecoverPoint | Vadim Zaigrin

Hi Vadim

Did you test USC with vplex 5.5? I have an exception «unparseable date», can you help?

Thanks
Chris

Ответить

Vadim Zaigrin:

20 июля, 2017 в 12:04

Hi

I didn’t test VPLEX 5.5, but I don’t think that there is a big difference with it.

What do you mean «unparseable date»?
USC reads first and last lines from monitors files in csv format.

Ответить
- Chris:
  
  20 июля, 2017 в 12:48
  
  Thanks for your reply
  In fact I just found a thread in your github about it, it seems monitors ans sinks need to be created first….
  I ll give a try to your awesome work and keep you informed
  
  Best regards

Hi Vadim,

It’s working, thanks a lot for your work!

Chris

Ответить

Vadim Zaigrin:

21 июля, 2017 в 11:54

Welcome 🙂

Ответить

Hi Vadim,

Just a quick question : is there a way to have USC logging? I can’t see anything in /opt/USC/log
Thanks

Ответить

Vadim Zaigrin:

2 августа, 2017 в 22:07

USC logs only known errors.
USC uses fr.janalyse.ssh.SSH for connections to VPLEX, but it doesn’t return any informations about connections.

Ответить
- Chris:
  
  3 августа, 2017 в 16:43
  
  Hi
  
  In fact I m playing with the vnx nar extractor and I needed to see if it extracts the storage pools stats from nar because I can’t see them in graphite.
  Maybe the object must be added in collector.xml ?
Vadim Zaigrin:

4 августа, 2017 в 09:55

VNXNar extractor is less configurable extractor. Because it is hard to make it flexible.
But there are not a lot of data about pools in nar files. Only FAST Cache and MLU measurements.

When USC extract data about LUNs and Disks it says what pool its belong to.
In the Grafana you could sum over all LUNs from one pool — this will be information about pool.

To check what USC outputs to the Carbon you could use ‘nc’ utility: nc -k -l 2003
Here is an example of such output: https://vzaigrin.files.wordpress.com/2017/01/vnxnar.png

Ответить
- Chris:
  
  4 августа, 2017 в 12:47
  
  Hi Vadim
  
  Thanks for your reply, VNX performance stats are indeed pretty poor, I’ll give up till next array
  
  Have a nice day

Hi Vadim,

i’m playing with the vplex extractor and it seems to me that everytime you pull a csv-file you open a new ssh conncetion. would’n it be possible to pull all csv files over a single ssh connection? should even be faster, doesn’t it?

Ответить

Vadim Zaigrin:

22 сентября, 2017 в 21:48

Hello.

Sorry for delayed answer.

Yes, for each monitor (csv file) VPLEX Extractor executes SSH.once with ‘tail’ command.
It is possible to start a SSH session before ‘monitors foreach’ cycle, execute command inside cycle, and close session after cycle.
But I don’t have an access to VPLEX hardware anymore.
So I cann’t try this.

Ответить

Hi, impressive job!
Does the vnx collector also work with Unity?

Ответить

Tim:

14 февраля, 2018 в 13:11

nvm, i just saw your answer on github issue 🙂

Ответить

Hello Vadim.

Im having problems when compiling your tool, seems that the janalyse-ssh repo is not working, i tried with Maven and a newer version but im getting errors.

Maybe its just me doing it wrong(1st time compiling scala 😉 ). Im trying to compile with : sudo sbt compile build.sbt

Hope you can help me, thanks for your work!
Regards

Ответить

Vadim Zaigrin:

11 июля, 2018 в 14:06

Hello.
You don’t need to compile it if you didn’t change the source code.
Compiled version is available here: https://github.com/vzaigrin/UniversalStorageCollector/tree/master/USC

Ответить
- Daniel:
  
  12 июля, 2018 в 01:39
  
  I ended up compiling it haha ^^
  
  Changed the janalyse-ssh to a newer version and this repo: http://central.maven.org/maven2/ and worked.
  
  Thanks for the tool! Really usefull!
- Vadim Zaigrin:
  
  12 июля, 2018 в 11:42
  
  Great! 🙂

Hi Vadim! I took a look at the configuration file of section «vnxblock» and saw that you get SP utilization from value returned by SP. I think this not very correct, because this value is show SP utulization from last statistics clear or SP reboot… I find an article https://thesanguy.com/2012/10/24/automating-storage-processor-utilization-alerts-with-emc-performance-manager/ where more correct SP utulization value calculated. What algorithm you use to calculate SP utilization?

Ответить

Vadim Zaigrin:

3 октября, 2018 в 13:51

Yes, I know that naviseccli returns strange results. I show it here: https://vzaigrin.wordpress.com/2017/01/29/extracting-data-from-nar-files
That’s why I preffer to work with nar files.

Ответить
- Vadim Kuchin:
  
  3 октября, 2018 в 16:09
  
  Thank you for reply. Do you have plans to correct your SP utilization algorithm to use mentioned formulae and data from naviseccli? Because we interested in online monitoring, not to collect NAR files…
- Vadim Zaigrin:
  
  3 октября, 2018 в 19:09
  
  Do you mean «Utilization = Busy Ticks / (Busy Ticks + Idle Ticks)»?
  No. My tool provides output from storage systems «as is». We could compute over its later. For example, in Grafana, Graphite or in Influxdb.

Can i get some detailed step by step document to install and configure this with Graphite and Grafana to collect data from VNX (File and Block), VMAX?

Ответить

Vadim Zaigrin:

6 января, 2019 в 14:44

Hi! This is simple.
1) Download contents of https://github.com/vzaigrin/UniversalStorageCollector/tree/master/USC into folder /opt/USC
2) Change file /opt/USC/conf/collector.xml to match your systems. Storage systems are in section «systems». Graphite and InfluxDB are in section «outputs».
3) If you use systemd:
— copy file /opt/USC/bin/collector.service into folder /etc/systemd/system
— automatically get it to start on boot: systemctl enable collector
— start the service: systemctl start collector

Ответить

Hey Vadim — I wanted to say — nice work! Worked slick for a test environment of mine. Thanks brother!

Ответить

Vadim Zaigrin:

31 октября, 2019 в 20:44

Thanks for the feedback. I’m glad my work is useful.

Ответить

Уведомление: VPLEXCollector – tool for gathering EMC VPLEX | Vadim Zaigrin
Уведомление: VNXCollector – DIY EMC VNX Monitoring and Reporting | Vadim Zaigrin
Уведомление: Extracting data from NAR files | Vadim Zaigrin
Уведомление: Extracting performance data from EMC VMAX | Vadim Zaigrin
Уведомление: Extracting performance data from EMC RecoverPoint | Vadim Zaigrin
Chris:

19 июля, 2017 в 12:59

Hi Vadim

Did you test USC with vplex 5.5? I have an exception «unparseable date», can you help?

Thanks
Chris

Ответить
- Vadim Zaigrin:
  
  20 июля, 2017 в 12:04
  
  Hi
  
  I didn’t test VPLEX 5.5, but I don’t think that there is a big difference with it.
  
  What do you mean «unparseable date»?
  USC reads first and last lines from monitors files in csv format.
  
  Ответить
  - Chris:
    
    20 июля, 2017 в 12:48
    
    Thanks for your reply
    In fact I just found a thread in your github about it, it seems monitors ans sinks need to be created first….
    I ll give a try to your awesome work and keep you informed
    
    Best regards
Chris:

21 июля, 2017 в 11:03

Hi Vadim,

It’s working, thanks a lot for your work!

Chris

Ответить
- Vadim Zaigrin:
  
  21 июля, 2017 в 11:54
  
  Welcome 🙂
  
  Ответить
Chris:

1 августа, 2017 в 12:34

Hi Vadim,

Just a quick question : is there a way to have USC logging? I can’t see anything in /opt/USC/log
Thanks

Ответить
- Vadim Zaigrin:
  
  2 августа, 2017 в 22:07
  
  USC logs only known errors.
  USC uses fr.janalyse.ssh.SSH for connections to VPLEX, but it doesn’t return any informations about connections.
  
  Ответить
  - Chris:
    
    3 августа, 2017 в 16:43
    
    Hi
    
    In fact I m playing with the vnx nar extractor and I needed to see if it extracts the storage pools stats from nar because I can’t see them in graphite.
    Maybe the object must be added in collector.xml ?
- Vadim Zaigrin:
  
  4 августа, 2017 в 09:55
  
  VNXNar extractor is less configurable extractor. Because it is hard to make it flexible.
  But there are not a lot of data about pools in nar files. Only FAST Cache and MLU measurements.
  
  When USC extract data about LUNs and Disks it says what pool its belong to.
  In the Grafana you could sum over all LUNs from one pool — this will be information about pool.
  
  To check what USC outputs to the Carbon you could use ‘nc’ utility: nc -k -l 2003
  Here is an example of such output: https://vzaigrin.files.wordpress.com/2017/01/vnxnar.png
  
  Ответить
  - Chris:
    
    4 августа, 2017 в 12:47
    
    Hi Vadim
    
    Thanks for your reply, VNX performance stats are indeed pretty poor, I’ll give up till next array
    
    Have a nice day
Stefan:

25 августа, 2017 в 18:20

Hi Vadim,

i’m playing with the vplex extractor and it seems to me that everytime you pull a csv-file you open a new ssh conncetion. would’n it be possible to pull all csv files over a single ssh connection? should even be faster, doesn’t it?

Ответить
- Vadim Zaigrin:
  
  22 сентября, 2017 в 21:48
  
  Hello.
  
  Sorry for delayed answer.
  
  Yes, for each monitor (csv file) VPLEX Extractor executes SSH.once with ‘tail’ command.
  It is possible to start a SSH session before ‘monitors foreach’ cycle, execute command inside cycle, and close session after cycle.
  But I don’t have an access to VPLEX hardware anymore.
  So I cann’t try this.
  
  Ответить
Tim:

14 февраля, 2018 в 13:09

Hi, impressive job!
Does the vnx collector also work with Unity?

Ответить
- Tim:
  
  14 февраля, 2018 в 13:11
  
  nvm, i just saw your answer on github issue 🙂
  
  Ответить
Daniel:

11 июля, 2018 в 12:49

Hello Vadim.

Im having problems when compiling your tool, seems that the janalyse-ssh repo is not working, i tried with Maven and a newer version but im getting errors.

Maybe its just me doing it wrong(1st time compiling scala 😉 ). Im trying to compile with : sudo sbt compile build.sbt

Hope you can help me, thanks for your work!
Regards

Ответить
- Vadim Zaigrin:
  
  11 июля, 2018 в 14:06
  
  Hello.
  You don’t need to compile it if you didn’t change the source code.
  Compiled version is available here: https://github.com/vzaigrin/UniversalStorageCollector/tree/master/USC
  
  Ответить
  - Daniel:
    
    12 июля, 2018 в 01:39
    
    I ended up compiling it haha ^^
    
    Changed the janalyse-ssh to a newer version and this repo: http://central.maven.org/maven2/ and worked.
    
    Thanks for the tool! Really usefull!
  - Vadim Zaigrin:
    
    12 июля, 2018 в 11:42
    
    Great! 🙂
Vadim Kuchin:

3 октября, 2018 в 13:19

Hi Vadim! I took a look at the configuration file of section «vnxblock» and saw that you get SP utilization from value returned by SP. I think this not very correct, because this value is show SP utulization from last statistics clear or SP reboot… I find an article https://thesanguy.com/2012/10/24/automating-storage-processor-utilization-alerts-with-emc-performance-manager/ where more correct SP utulization value calculated. What algorithm you use to calculate SP utilization?

Ответить
- Vadim Zaigrin:
  
  3 октября, 2018 в 13:51
  
  Yes, I know that naviseccli returns strange results. I show it here: https://vzaigrin.wordpress.com/2017/01/29/extracting-data-from-nar-files
  That’s why I preffer to work with nar files.
  
  Ответить
  - Vadim Kuchin:
    
    3 октября, 2018 в 16:09
    
    Thank you for reply. Do you have plans to correct your SP utilization algorithm to use mentioned formulae and data from naviseccli? Because we interested in online monitoring, not to collect NAR files…
  - Vadim Zaigrin:
    
    3 октября, 2018 в 19:09
    
    Do you mean «Utilization = Busy Ticks / (Busy Ticks + Idle Ticks)»?
    No. My tool provides output from storage systems «as is». We could compute over its later. For example, in Grafana, Graphite or in Influxdb.
Karthikeyan Sundaram:

14 декабря, 2018 в 12:24

Can i get some detailed step by step document to install and configure this with Graphite and Grafana to collect data from VNX (File and Block), VMAX?

Ответить
- Vadim Zaigrin:
  
  6 января, 2019 в 14:44
  
  Hi! This is simple.
  1) Download contents of https://github.com/vzaigrin/UniversalStorageCollector/tree/master/USC into folder /opt/USC
  2) Change file /opt/USC/conf/collector.xml to match your systems. Storage systems are in section «systems». Graphite and InfluxDB are in section «outputs».
  3) If you use systemd:
  — copy file /opt/USC/bin/collector.service into folder /etc/systemd/system
  — automatically get it to start on boot: systemctl enable collector
  — start the service: systemctl start collector
  
  Ответить
Mark Brown:

31 октября, 2019 в 20:29

Hey Vadim — I wanted to say — nice work! Worked slick for a test environment of mine. Thanks brother!

Ответить
- Vadim Zaigrin:
  
  31 октября, 2019 в 20:44
  
  Thanks for the feedback. I’m glad my work is useful.
  
  Ответить

Вадим Заигрин

Universal Storage Collector

What is it

How it works

Setup

Один ответ на “Universal Storage Collector”

Ответить на Vadim Zaigrin Отменить ответ

What is it

How it works

Setup

Поделиться ссылкой:

Похожее

Один ответ на “Universal Storage Collector”

Ответить на Vadim Zaigrin Отменить ответ