-
Notifications
You must be signed in to change notification settings - Fork 304
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HPCC-33000 Add a Troubleshooting chapter to Containerized manual #19303
HPCC-33000 Add a Troubleshooting chapter to Containerized manual #19303
Conversation
Jira Issue: https://hpccsystems.atlassian.net//browse/HPCC-33000 Jirabot Action Result: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
few minor items - inline.
pod status, you can quickly identify and address potential issues, | ||
ensuring the health and performance of your applications. </para> | ||
|
||
<para>The HPCC platform has a pod for each component of a system.</para> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
HPCC Systems platform
|
||
<para>This lists all pods in all namespaces.</para> | ||
|
||
<para>Each pod should indicate a Status of <emphasis |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Status s/b lower case
<para><programlisting>kubectl get services</programlisting></para> | ||
|
||
<para>This lists all services in your cluster, along with their type, | ||
internal and external IP addresses, Port, and Uptime (Age).</para> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
capitalization ?? should Port and Uptime be init cap?
errors occurring within your application containers. By analyzing these | ||
logs, you can quickly identify and address a wide range of issues.</para> | ||
|
||
<para>To view the logs of a specific pod,use the following command in a |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add space to pod,use
|
||
<para><programlisting>kubectl logs <pod-name> --since=10m</programlisting></para> | ||
|
||
<para>filters logs from a specific time.</para> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Init cap filters?
|
||
<variablelist> | ||
<varlistentry> | ||
<term>Check your Helm chart configuration</term> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add period to sentence end.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I added periods to all of these.
fa63834
to
43c88d5
Compare
43c88d5
to
33b465f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good - approved.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@JamesDeFabia - looks good in general, a few minor comments.
pod status, you can quickly identify and address potential issues, | ||
ensuring the health and performance of your applications. </para> | ||
|
||
<para>The HPCC Systems platform has a pod for each component of a system.</para> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
.. has 1 or more pod corresponding to each component of a deployed system. ?
(e.g. Thor has 2 static agents, and other dynamically launched pods for the manager + workers)
|
||
<variablelist> | ||
<varlistentry> | ||
<term>Container Failures </term> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
trivial: extra space after Failures
|
||
<para><programlisting>kubectl logs <pod-name></programlisting></para> | ||
|
||
<para>This returns the entire log for a pod.</para> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
how much is returned will depend on the log rotation policy, which I think will depend on the container runtime retention policy.
Just saying, if the pod's logs is GB's long - kubectl logs is unlikely to return it all..
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Might also be worth noting something about pods with multiple containers. Dali by default has 2 (dali itself and sasha-coalescer). By default kubectl logs will return the logs of the 1st container in the pod.
If you need the logs for a specific container you need to say:
kubectl logs -c
<term>Debug application behavior</term> | ||
|
||
<listitem> | ||
<para>Examine container logs and events to understand the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is under "By carefully analyzing this information, you can:" , suggesting you can see logs via kubectl describe pod.. - a tad misleading.. since kubectl describe will not show you any logging insights.
You need to run kubectl logs for that...
Maybe delete this listitem, since you have a later section covering logging anyway.
Signed-off-by: Jim DeFabia <[email protected]>
33b465f
to
35f690b
Compare
@jakesmith Made the edits you requested, please have another look |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@JamesDeFabia - looks good.
Jirabot Action Result: |
Type of change:
Checklist:
Smoketest:
Testing:
https://github.com/JamesDeFabia/github-action-dev-build/actions/runs/11862422016/artifacts/2194787675