Skip to content

avensolutions/datadog_yarn_metrics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DataDog Collect YARN Metrics

Collects application metrics from YARN and publishes these to DataDog.

Example DataDog YARN Metrics Dashboard

datadog-screenshot

Metrics collected

Datadog metrics:
	yarn.apps.queued								(Desc: COUNT of ALL queued applications, Tags:	None)
	yarn.apps.failed								(Desc: COUNT of ALL failed applications (evaluated hourly), Tags:	None)
	yarn.apps.failed.byQueue						(Desc: COUNT of failed applications by Queue (evaluated hourly), Tags:	queue:{default, production, etc})
	yarn.apps.failed.byAppType						(Desc: COUNT of failed applications by AppType (evaluated hourly), Tags: apptype:{MR, TEZ, SPARK, etc})
	yarn.apps.failed.byUser							(Desc: COUNT of failed applications by User (evaluated hourly), Tags: user:{u12345, etc}
	yarn.apps.succeeded								(Desc: COUNT of ALL succeeded applications (evaluated hourly), Tags:	None)
	yarn.apps.succeeded.byQueue						(Desc: COUNT of succeeded applications by Queue (evaluated hourly), Tags:	queue:{default, production, etc})
	yarn.apps.succeeded.byAppType					(Desc: COUNT of succeeded applications by AppType (evaluated hourly), Tags: apptype:{MR, TEZ, SPARK, etc})
	yarn.apps.succeeded.byUser						(Desc: COUNT of succeeded applications by User (evaluated hourly), Tags: user:{u12345, etc}
	yarn.apps.killed								(Desc: COUNT of ALL killed applications (evaluated hourly), Tags:	None)
	yarn.apps.killed.byQueue						(Desc: COUNT of killed applications by Queue (evaluated hourly), Tags:	queue:{default, production, etc})
	yarn.apps.killed.byAppType						(Desc: COUNT of killed applications by AppType (evaluated hourly), Tags: apptype:{MR, TEZ, SPARK, etc})
	yarn.apps.killed.byUser							(Desc: COUNT of killed applications by User (evaluated hourly), Tags: user:{u12345, etc}		
	yarn.apps.running								(Desc: COUNT of ALL running applications, Tags:	None)		
	yarn.apps.running.submittype					(Desc: COUNT by SubmitType, Tags: submittype:BATCH|INTERACTIVE)
	yarn.apps.running.allocatedGB					(Desc: SUM allocatedGB, Tags: None)		
	yarn.apps.running.allocatedVCores				(Desc: SUM allocatedVCores, Tags: None)		
	yarn.apps.running.runningContainers				(Desc: SUM runningContainers, Tags:	None)		
	yarn.apps.running.maxElapsedTime				(Desc: MAX ElapsedTime, Tags: None)		
	yarn.apps.running.maxAllocatedGB				(Desc: MAX allocatedGB, Tags: None)		
	yarn.apps.running.maxAllocatedVCores			(Desc: MAX allocatedVCores, Tags: None)		
	yarn.apps.running.maxRunningContainers			(Desc: MAX RunningContainers, Tags:	None)		
	yarn.apps.running.maxMemorySeconds				(Desc: MAX MemorySeconds, Tags:	None)		
	yarn.apps.running.maxVCoreSeconds				(Desc: MAX VCoreSeconds, Tags: None)		
	yarn.apps.running.apptype						(Desc: COUNT by AppType, Tags: apptype:{MR, TEZ, SPARK, etc})
	yarn.apps.running.queue							(Desc: COUNT by Queue, Tags: queue:{default, production, etc})
	yarn.apps.running.allocatedGB.byQueue			(Desc: SUM by Queue, Tags: queue:{default, production, etc})
	yarn.apps.running.allocatedGB.byAppType			(Desc: SUM by AppType, Tags: apptype:{MR, TEZ, SPARK, etc})
	yarn.apps.running.allocatedVCores.byQueue		(Desc: SUM by Queue, Tags: queue:{default, production, etc})
	yarn.apps.running.allocatedVCores.byAppType		(Desc: SUM by AppType, Tags: apptype:{MR, TEZ, SPARK, etc})
	yarn.apps.running.runningContainers.byQueue		(Desc: COUNT by Queue, Tags: queue:{default, production, etc})
	yarn.apps.running.runningContainers.byAppType	(Desc: COUNT by AppType, Tags: apptype:{MR, TEZ, SPARK, etc})
	yarn.apps.running.totalMemorySeconds.byQueue	(Desc: SUM by Queue, Tags: queue:{default, production, etc})
	yarn.apps.running.totalMemorySeconds.byAppType	(Desc: SUM by AppType, Tags: apptype:{MR, TEZ, SPARK, etc})
	yarn.apps.running.totalVCoreSeconds.byQueue		(Desc: SUM by Queue, Tags: queue:{default, production, etc})
	yarn.apps.running.totalVCoreSeconds.byAppType	(Desc: SUM by AppType, Tags: apptype:{MR, TEZ, SPARK, etc})

Dependencies

  • DataDog Agent
  • Python 2.7 +

About

Collects application metrics from YARN and publishes these to DataDog

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages