Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FR Prod instance down #9596

Closed
audez opened this issue Aug 26, 2022 · 11 comments
Closed

FR Prod instance down #9596

audez opened this issue Aug 26, 2022 · 11 comments
Labels
bug-s2 The bug is affecting any of the non-critical features described in S1 and there is no workaround.

Comments

@audez
Copy link
Collaborator

audez commented Aug 26, 2022

Description

Website completely down.
Then Getting the snail when trying to access any reports at https://coopcircuits.fr/admin/reports

@audez audez added bug-s1 The bug is stopping the platform from working, and there is no workaround. Impact of lot of users. and removed bug-s1 The bug is stopping the platform from working, and there is no workaround. Impact of lot of users. labels Aug 26, 2022
@audez
Copy link
Collaborator Author

audez commented Aug 26, 2022

this is OK now. The bug lasted 1h-2h, was reported at 9:55 this morning

  • snail on reports
  • importing orders to the integrated cash software (Pastèque) failed
  • Every page was very slow to load (It was already slow yesterday)

It looks like it might be linked to our previous memory issue..

@audez audez added the bug-s2 The bug is affecting any of the non-critical features described in S1 and there is no workaround. label Aug 26, 2022
@audez
Copy link
Collaborator Author

audez commented Aug 26, 2022

ongoing work:
#9398
#9413

@RachL
Copy link
Contributor

RachL commented Aug 26, 2022

@audez the snail did not occur only on reports, right, on all pages?

As on kuma we can see it's a server timeout:

image

Note we had the same downtime yesterday.

Datadog confirms the first spike occured at 9:49 with some request already starting at 9:39 :

image

I haven't had time yet to see which request started the chain reaction. Perhaps if someone who has ssh access can access the logs, that would be quicker.

I'll be back on Monday, but if we need to put more metal to it in the meantime, our credit card is stored in our host account. You can reach me on telegram or signal if there is a 2FA problem.

@audez
Copy link
Collaborator Author

audez commented Aug 26, 2022

thanks!
Yes you're right, the user that reported at 9:55 confirms that everything was down.
For me the snail occured only when trying to access a report - I tested around 11:30. I was able to access orders (but it was veeery slow) and all pages were slow. Yesterday everything was slow too - I think I loaded some reports around 12:30 and got no snail though

@audez audez changed the title Impossible to access reports CoopCircuits.fr down Aug 26, 2022
@audez
Copy link
Collaborator Author

audez commented Aug 26, 2022

everything down again around 15:15 --> i did a superadmin enterprise search, might be due to this..
Edit: and down again at 15:30
Screen Shot 2022-08-26 at 15 23 58

It's confirmed by datadog spike
Screen Shot 2022-08-26 at 15 25 42

@audez audez added bug-s1 The bug is stopping the platform from working, and there is no workaround. Impact of lot of users. and removed bug-s2 The bug is affecting any of the non-critical features described in S1 and there is no workaround. labels Aug 26, 2022
@RachL
Copy link
Contributor

RachL commented Aug 26, 2022

@audez did you loggedin as super admin or enterprise user?

@audez
Copy link
Collaborator Author

audez commented Aug 26, 2022

in a window as super admin and in another enterprise user
just now without being logged in i couldn't reach coopcircuits.fr, down again
Now it seems ok but very slow

@mkllnk
Copy link
Member

mkllnk commented Aug 29, 2022

in a window as super admin and in another enterprise user

If it's in the same browser then you are logged in with the same account.

@mkllnk mkllnk assigned mkllnk and unassigned mkllnk Aug 29, 2022
@RachL RachL added bug-s2 The bug is affecting any of the non-critical features described in S1 and there is no workaround. and removed bug-s1 The bug is stopping the platform from working, and there is no workaround. Impact of lot of users. labels Aug 29, 2022
@audez
Copy link
Collaborator Author

audez commented Aug 29, 2022

@mkllnk yea thanks :P it was same browser but one private window and one normal so I could test both. But it was actually the same whether you were logged in as superadmin, enterprise, or not logged in..

@mkllnk
Copy link
Member

mkllnk commented Aug 30, 2022

Probably best to plan downtime between 2am and 7am French time. That should cover my workday very well.

@audez audez changed the title CoopCircuits.fr down Instance down Aug 30, 2022
@sigmundpetersen sigmundpetersen changed the title Instance down FR Pord instance down Aug 31, 2022
@sigmundpetersen sigmundpetersen changed the title FR Pord instance down FR Prod instance down Aug 31, 2022
@RachL
Copy link
Contributor

RachL commented Sep 15, 2022

Closing for now. Let's see if we need to reopen after openfoodfoundation/ofn-install#825

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug-s2 The bug is affecting any of the non-critical features described in S1 and there is no workaround.
Projects
None yet
Development

No branches or pull requests

3 participants