jcrespo (Jaime Crespo)
Sr Database Administrator

Projects (12)
View All

bacula
Component
Data-Persistence-Backup
Component
database-backups
Component
DBA
Group
dbbackups-dashboard
Component

Calendar

User Details

User Since: May 11 2015, 8:31 AM (475 w, 2 d)
Availability: Available
IRC Nick: jynus
LDAP User: Jcrespo
MediaWiki User: JCrespo (WMF) [ Global Accounts ]

Recent Activity
View All

Today

jcrespo added a comment to T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding.

I wasn't worried about immediate alerts. I know those won't change for now.

Wed, Jun 19, 1:19 PM · Patch-For-Review, DBA

jcrespo added a comment to T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding.

@ABran-WMF It very likely change it, because as you shared, the exporter does:

Wed, Jun 19, 10:14 AM · Patch-For-Review, DBA

jcrespo added a comment to T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding.

In T367278#9906323, @Marostegui wrote:

In T367278#9906299, @jcrespo wrote:

I don't see a clear difference with the current icinga/perl implementation.

In the past there was 2 additional fields for the section and datacenter- so there were 2 rows being written at the same time from both masters. I don't know if that changed for orchestator setup needs, I wasn't involved since its original setup..

That hasn't changed - both masters write their own heartbeats and they get replicated.

If eqiad is primary:
eqiad hosts get eqiad's heartbeat
codfw hosts get eqiads and codfw heartbeats

If codfw is primary:
eqiad hosts get eqiad's and codfw heartbeat
codfw hosts get codfw heartbeats

Wed, Jun 19, 9:35 AM · Patch-For-Review, DBA

jcrespo added a comment to T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding.

In T367278#9906323, @Marostegui wrote:

In T367278#9906299, @jcrespo wrote:

I don't see a clear difference with the current icinga/perl implementation.

In the past there was 2 additional fields for the section and datacenter- so there were 2 rows being written at the same time from both masters. I don't know if that changed for orchestator setup needs, I wasn't involved since its original setup..

That hasn't changed - both masters write their own heartbeats and they get replicated.

If eqiad is primary:
eqiad hosts get eqiad's heartbeat
codfw hosts get eqiads and codfw heartbeats

If codfw is primary:
eqiad hosts get eqiad's and codfw heartbeat
codfw hosts get codfw heartbeats

Wed, Jun 19, 9:31 AM · Patch-For-Review, DBA

jcrespo added a comment to T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding.

I don't see a clear difference with the current icinga/perl implementation.

Wed, Jun 19, 9:20 AM · Patch-For-Review, DBA

jcrespo added a comment to T367162: db1240.s3 index issues.

I am also not going to enable any account until the end of the load (including monitoring) to avoid any bad interaction.

Wed, Jun 19, 6:54 AM · Data-Persistence-Backup

jcrespo added a comment to T367162: db1240.s3 index issues.

The process seems to have failed at the last steps. Retrying with a higher buffer pool and stopping s1.

Wed, Jun 19, 6:51 AM · Data-Persistence-Backup

Yesterday

jcrespo updated the task description for T367882: Possible weird interaction between es backups and puppet runs leading to failures.

Tue, Jun 18, 2:24 PM · Data-Persistence, database-backups, Puppet

jcrespo added projects to T367882: Possible weird interaction between es backups and puppet runs leading to failures: Puppet, database-backups, Data-Persistence.

Tue, Jun 18, 2:22 PM · Data-Persistence, database-backups, Puppet

jcrespo added a comment to T367882: Possible weird interaction between es backups and puppet runs leading to failures.

I am leaving for the day, but there is a chance this is not worth debugging because the hosts are about to be decommissioned (unless it happens on the new ones, too). Filing it in case it could be useful for other perf issues for other hosts.

Tue, Jun 18, 2:22 PM · Data-Persistence, database-backups, Puppet

jcrespo created T367882: Possible weird interaction between es backups and puppet runs leading to failures.

Tue, Jun 18, 2:18 PM · Data-Persistence, database-backups, Puppet

jcrespo added a comment to T367162: db1240.s3 index issues.

While technically the host didn't crash- it had an "unscheduled normal shutdown", given it is the source of s3 backups on eqiad, I am going to recover it from backups.

Tue, Jun 18, 6:36 AM · Data-Persistence-Backup

jcrespo placed T366892: decommission db2102.codw.wmnet up for grabs.

Tue, Jun 18, 6:23 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo updated the task description for T358741: Decommission db2096-db2120.

Tue, Jun 18, 6:04 AM · DBA

jcrespo updated the task description for T366892: decommission db2102.codw.wmnet.

Tue, Jun 18, 5:58 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

Mon, Jun 17

jcrespo added a comment to P65105 Codfw media backup status.

And this is the wiki distribution:

Mon, Jun 17, 10:19 AM · media-backups

jcrespo added a comment to P65105 Codfw media backup status.

This is the API request I filed: T267365

Mon, Jun 17, 10:18 AM · media-backups

jcrespo added a project to P65105 Codfw media backup status: media-backups.

Mon, Jun 17, 10:14 AM · media-backups

jcrespo updated subscribers of P65105 Codfw media backup status.

@ABran-WMF As you can see, codfw health status is much better (I queries just before restarting it) ^

Mon, Jun 17, 10:13 AM · media-backups

jcrespo created P65105 Codfw media backup status.

Mon, Jun 17, 10:12 AM · media-backups

Fri, Jun 14

jcrespo closed T360751: Upgrade backup sources to MariaDB 10.6 as Resolved.

Done!

Fri, Jun 14, 3:21 PM · Data-Persistence, Data-Persistence-Backup

jcrespo closed T360751: Upgrade backup sources to MariaDB 10.6, a subtask of T356960: Upgrade hosts to MariaDB 10.6, as Resolved.

Fri, Jun 14, 3:20 PM · DBA

jcrespo updated the task description for T360751: Upgrade backup sources to MariaDB 10.6.

Fri, Jun 14, 3:19 PM · Data-Persistence, Data-Persistence-Backup

jcrespo added a comment to T366892: decommission db2102.codw.wmnet.

Deleted from zarcillo and stopped.

Fri, Jun 14, 3:16 PM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo updated the task description for T366892: decommission db2102.codw.wmnet.

Fri, Jun 14, 3:16 PM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo triaged T367162: db1240.s3 index issues as Medium priority.

Fri, Jun 14, 2:26 PM · Data-Persistence-Backup

Wed, Jun 12

jcrespo added a comment to T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding.

The alerts should be configurable by lag and by role from puppet- that means: I don't want alerts for backup sources that are < 4h, as I regularly stop those while taking the backups. E.g. core db hosts vs misc vs test hosts, etc.

Wed, Jun 12, 2:55 PM · Patch-For-Review, DBA

jcrespo added a comment to T365998: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 -lsw1-f3-eqiad .

db1205 is the secondary media backups metadata db server, usually just a standby to db1204. Unless it is the active server because the primary is unavailable, it just has to be checked that replication restarts correctly after maintenance.

Wed, Jun 12, 9:40 AM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE

jcrespo added a comment to T365996: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 - lsw1-f1-eqiad .

backup1011 is a mediabackups storage server. Ideally, mediabackups are paused during the maintenance to avoid backup errors.

Wed, Jun 12, 9:36 AM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE

jcrespo added a comment to T365995: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 - lsw1-e3-eqiad.

backup1009 is the main backup node for bacula on eqiad. Most backups happen during the night- so just monitoring that it came back and new backups happen normally would be enough.

Wed, Jun 12, 9:35 AM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE

jcrespo added a comment to T365993: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 - lsw1-e1-eqiad.

backup1010 is in intermittent usage to support mediabackups disk space, but mostly idle at the time, so unless it situtation changes by july and finally gets pooled for bacula, it will require no action.

Wed, Jun 12, 9:29 AM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE

jcrespo added a comment to T363812: Setup backups for es6, es7 and archive old read only backups.

@Marostegui, in order to resolve this ticket, now that read activity I assume is lower, do you think I could get a host from es4 and es5 on both dcs depooled for a day and with exclusive usage in order to take a final, archivable, full backup of those sections? Doesn't have to happen at the same time on the 4 hosts:

Wed, Jun 12, 9:08 AM · database-backups, Data-Persistence-Backup

jcrespo updated the task description for T363812: Setup backups for es6, es7 and archive old read only backups.

Wed, Jun 12, 9:01 AM · database-backups, Data-Persistence-Backup

jcrespo added a comment to T367162: db1240.s3 index issues.

@ABran-WMF Thanks for handling it. To confirm, the issue happened at 2024-06-11 13:53:41 (Tuesday), right (or before?)? Because I may recover the host from backups just to be 100% sure there is no leftover corruption.

Wed, Jun 12, 8:43 AM · Data-Persistence-Backup

Fri, Jun 7

jcrespo updated the task description for T358741: Decommission db2096-db2120.

Fri, Jun 7, 10:24 AM · DBA

jcrespo created T366892: decommission db2102.codw.wmnet.

Fri, Jun 7, 10:23 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo updated the task description for T360751: Upgrade backup sources to MariaDB 10.6.

Fri, Jun 7, 10:00 AM · Data-Persistence, Data-Persistence-Backup

jcrespo added a comment to T362883: decommission db2099.codfw.wmnet.

This is ready for dc-ops.

Fri, Jun 7, 8:59 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo placed T362883: decommission db2099.codfw.wmnet up for grabs.

Fri, Jun 7, 8:59 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo added a comment to T366877: decommission db2098.codfw.wmnet.

This is ready for dc ops.

Fri, Jun 7, 8:58 AM · SRE, ops-codfw, DC-Ops, decommission-hardware

jcrespo renamed T366877: decommission db2098.codfw.wmnet from decommission db2098 to decommission db2098.codfw.wmnet.

Fri, Jun 7, 8:58 AM · SRE, ops-codfw, DC-Ops, decommission-hardware

jcrespo placed T362802: decommission db2097.codfw.wmnet up for grabs.

Fri, Jun 7, 8:56 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo added a comment to T362802: decommission db2097.codfw.wmnet.

This is ready for dc ops.

Fri, Jun 7, 8:56 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo updated the task description for T358741: Decommission db2096-db2120.

Fri, Jun 7, 8:52 AM · DBA

jcrespo updated the task description for T360751: Upgrade backup sources to MariaDB 10.6.

Fri, Jun 7, 8:40 AM · Data-Persistence, Data-Persistence-Backup

jcrespo updated the task description for T362883: decommission db2099.codfw.wmnet.

Fri, Jun 7, 8:22 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo updated the task description for T366877: decommission db2098.codfw.wmnet.

Fri, Jun 7, 8:21 AM · SRE, ops-codfw, DC-Ops, decommission-hardware

jcrespo updated the task description for T362802: decommission db2097.codfw.wmnet.

Fri, Jun 7, 8:21 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo updated the task description for T358741: Decommission db2096-db2120.

Fri, Jun 7, 7:42 AM · DBA

jcrespo added a subtask for T360751: Upgrade backup sources to MariaDB 10.6: T362883: decommission db2099.codfw.wmnet.

Fri, Jun 7, 7:32 AM · Data-Persistence, Data-Persistence-Backup

jcrespo added a parent task for T362883: decommission db2099.codfw.wmnet: T360751: Upgrade backup sources to MariaDB 10.6.

Fri, Jun 7, 7:32 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo added a subtask for T360751: Upgrade backup sources to MariaDB 10.6: T366877: decommission db2098.codfw.wmnet.

Fri, Jun 7, 7:26 AM · Data-Persistence, Data-Persistence-Backup

jcrespo added a parent task for T366877: decommission db2098.codfw.wmnet: T360751: Upgrade backup sources to MariaDB 10.6.

Fri, Jun 7, 7:26 AM · SRE, ops-codfw, DC-Ops, decommission-hardware

jcrespo added a subtask for T360751: Upgrade backup sources to MariaDB 10.6: T362802: decommission db2097.codfw.wmnet.

Fri, Jun 7, 7:26 AM · Data-Persistence, Data-Persistence-Backup

jcrespo added a parent task for T362802: decommission db2097.codfw.wmnet: T360751: Upgrade backup sources to MariaDB 10.6.

Fri, Jun 7, 7:26 AM · SRE, DC-Ops, ops-codfw, decommission-hardware

jcrespo created T366877: decommission db2098.codfw.wmnet.

Fri, Jun 7, 7:16 AM · SRE, ops-codfw, DC-Ops, decommission-hardware

Tue, Jun 4

jcrespo added a comment to T201662: Change the MySQL passwords.

Then I mistook the ops user with the ops db, sorry.

Tue, Jun 4, 2:22 PM · SecTeam-Processed, Security, DBA

jcrespo added a comment to T201662: Change the MySQL passwords.

ops is the user under which the query killer events & logs run. If you drop it, events will fail and dbs will be overloaded, as it happens usually when the events for a db haven't been loaded properly.

Tue, Jun 4, 10:37 AM · SecTeam-Processed, Security, DBA

Wed, May 29

jcrespo updated the task description for T360751: Upgrade backup sources to MariaDB 10.6.

Wed, May 29, 4:28 PM · Data-Persistence, Data-Persistence-Backup

jcrespo updated the task description for T364290: Upgrade s1 to MariaDB 10.6.

Wed, May 29, 4:27 PM · DBA

jcrespo added a comment to T364290: Upgrade s1 to MariaDB 10.6.

I will migrate the backups to 10.6 without removing yet the 10.4 backup sources.

Wed, May 29, 3:28 PM · DBA

jcrespo added a comment to T363581: Build a machine-readable catalogue of mariadb tables in production.

@Volans not Amir, but Re: your first question, my understanding is that this was a compromise to make sure there was something good enough and simple short term, rather than overengineering from the start. That doesn't mean that what you suggest is discarded, but something that could be improved later on. For example, I am personally interested on having a querable service/API later for backup checks, but this is better than nothing ATM, with relatively small effort. Later on, a database could import the file and generate it, for example. So I am a fan of interating slowly as long as it is an improvement 0:-D.

Wed, May 29, 2:21 PM · Patch-For-Review, DBA

Thu, May 23

jcrespo updated the task description for T365607: Reprovision missing files due to backup1005 hw issues.

Thu, May 23, 8:06 AM · Data-Persistence-Backup, media-backups

Wed, May 22

jcrespo added a parent task for T361087: backup1005 crashed: T365607: Reprovision missing files due to backup1005 hw issues.

Wed, May 22, 2:46 PM · SRE, ops-eqiad, DC-Ops, Data-Persistence-Backup, media-backups

jcrespo added a comment to T365607: Reprovision missing files due to backup1005 hw issues.

Followup to T361087.

Wed, May 22, 2:46 PM · Data-Persistence-Backup, media-backups

jcrespo added a subtask for T365607: Reprovision missing files due to backup1005 hw issues: T361087: backup1005 crashed.

Wed, May 22, 2:46 PM · Data-Persistence-Backup, media-backups

jcrespo triaged T365607: Reprovision missing files due to backup1005 hw issues as High priority.

Wed, May 22, 2:46 PM · Data-Persistence-Backup, media-backups

jcrespo created T365607: Reprovision missing files due to backup1005 hw issues.

Wed, May 22, 2:46 PM · Data-Persistence-Backup, media-backups

jcrespo closed T365217: Degraded RAID on backup2010 as Resolved.

I did a disk stress test for an hour or so, saw no media errors, smart errors or raid controller weirdness.

Resolving for now.

Wed, May 22, 1:50 PM · Data-Persistence-Backup, Data-Persistence, DC-Ops, SRE, ops-codfw

jcrespo added a comment to T365217: Degraded RAID on backup2010.

A disk was rebuilt on the 17 of May:

Wed, May 22, 7:25 AM · Data-Persistence-Backup, Data-Persistence, DC-Ops, SRE, ops-codfw

Tue, May 21

jcrespo added a comment to T363812: Setup backups for es6, es7 and archive old read only backups.

Stop es4 and es5 backups
Generate a full clusterX and clusterY last backup
Archive it into long term backups
Remove dump user

Tue, May 21, 11:31 AM · database-backups, Data-Persistence-Backup

May 14 2024

jcrespo added a parent task for T364447: Make es4 and es5 RO: T363812: Setup backups for es6, es7 and archive old read only backups.

May 14 2024, 8:43 AM · DBA

jcrespo added a subtask for T363812: Setup backups for es6, es7 and archive old read only backups: T364447: Make es4 and es5 RO.

May 14 2024, 8:43 AM · database-backups, Data-Persistence-Backup

jcrespo removed a subtask for T364447: Make es4 and es5 RO: T363812: Setup backups for es6, es7 and archive old read only backups.

May 14 2024, 8:42 AM · DBA

jcrespo removed a parent task for T363812: Setup backups for es6, es7 and archive old read only backups: T364447: Make es4 and es5 RO.

May 14 2024, 8:42 AM · database-backups, Data-Persistence-Backup

jcrespo added a subtask for T364447: Make es4 and es5 RO: T363812: Setup backups for es6, es7 and archive old read only backups.

May 14 2024, 8:42 AM · DBA

jcrespo removed a subtask for T355285: Productionize es10[35-40]: T363812: Setup backups for es6, es7 and archive old read only backups.

May 14 2024, 8:42 AM · DBA

jcrespo edited parent tasks for T363812: Setup backups for es6, es7 and archive old read only backups, added: T364447: Make es4 and es5 RO; removed: T355285: Productionize es10[35-40].

May 14 2024, 8:42 AM · database-backups, Data-Persistence-Backup

jcrespo updated the task description for T363812: Setup backups for es6, es7 and archive old read only backups.

May 14 2024, 8:41 AM · database-backups, Data-Persistence-Backup

jcrespo updated the task description for T363812: Setup backups for es6, es7 and archive old read only backups.

May 14 2024, 8:04 AM · database-backups, Data-Persistence-Backup

May 13 2024

jcrespo added a comment to T364296: Reimage db1215 and db2185 (zarcillo) to bookworm.

In T364296#9789045, @Marostegui wrote:

Sorry about it :( how can I help?

May 13 2024, 8:44 AM · DBA

jcrespo added a comment to T364296: Reimage db1215 and db2185 (zarcillo) to bookworm.

Thanks, the upgrade is no issue, but data will have a lot of backup errors due to not beeing depooled before maintenance, will need some work.

May 13 2024, 8:14 AM · DBA

May 9 2024

jcrespo added a comment to T360751: Upgrade backup sources to MariaDB 10.6.

All backups now will be generated from 10.6 servers, with the exception of s1. Leaving a couple of hosts with 10.4 before upgrading them/decomming them.

May 9 2024, 10:13 AM · Data-Persistence, Data-Persistence-Backup

jcrespo updated the task description for T360751: Upgrade backup sources to MariaDB 10.6.

May 9 2024, 9:59 AM · Data-Persistence, Data-Persistence-Backup

jcrespo triaged T362509: Setup new dbprov hosts and decommission the old ones as High priority.

May 9 2024, 9:58 AM · Patch-For-Review, database-backups, Data-Persistence-Backup

jcrespo added a comment to T363812: Setup backups for es6, es7 and archive old read only backups.

@Marostegui es6 and es7 backups are enabled, and a first run was done here. They seem mostly empty, though:

May 9 2024, 9:20 AM · database-backups, Data-Persistence-Backup

jcrespo updated the task description for T363812: Setup backups for es6, es7 and archive old read only backups.

May 9 2024, 9:16 AM · database-backups, Data-Persistence-Backup

jcrespo closed T363995: Commons: File:Gnome-edit-delete.svg not found as Resolved.

May 9 2024, 7:11 AM · SRE-swift-storage, Commons

May 7 2024

jcrespo added a comment to T363995: Commons: File:Gnome-edit-delete.svg not found.

In T363995#9775321, @jcrespo wrote:
[2024-05-06 14:33:33,903] INFO:backup '9/96/Gnome-edit-delete.svg' downloaded
[2024-05-06 14:33:33,904] INFO:backup sha256 sum of Gnome-edit-delete.svg is a45ec2020e0997a031bdd62a0dc30a518c82d9c1a100d6e8420bb2a6f938c48f
[2024-05-06 14:33:33,912] WARNING:backup A file with the same sha265 as "commonswiki Gnome-edit-delete.svg 3a36632c6569fcdf45aca81a71b53ae4faf80083 2024-05-06 14:16:14" was already uploaded, skipping.
So it was a "coincidence" that was backed up, and not the other way. I will check the backup logs to see why it was failing.

May 7 2024, 9:01 AM · SRE-swift-storage, Commons

May 6 2024

jcrespo added a comment to T363995: Commons: File:Gnome-edit-delete.svg not found.

It was failing back in 2021:

May 6 2024, 5:53 PM · SRE-swift-storage, Commons

jcrespo added a comment to T363995: Commons: File:Gnome-edit-delete.svg not found.

In T363995#9763970, @MatthewVernon wrote:

May 6 2024, 2:39 PM · SRE-swift-storage, Commons

jcrespo added a comment to T363995: Commons: File:Gnome-edit-delete.svg not found.

Here it is the 2 file versions (with the hash it can be checked they are the same files):

May 6 2024, 1:15 PM · SRE-swift-storage, Commons

Apr 30 2024

jcrespo closed T361087: backup1005 crashed as Resolved.

Apr 30 2024, 10:45 AM · SRE, ops-eqiad, DC-Ops, Data-Persistence-Backup, media-backups

jcrespo changed the status of T363812: Setup backups for es6, es7 and archive old read only backups from Open to In Progress.

Apr 30 2024, 10:42 AM · database-backups, Data-Persistence-Backup

jcrespo changed the status of T363812: Setup backups for es6, es7 and archive old read only backups, a subtask of T355285: Productionize es10[35-40], from Open to In Progress.

Apr 30 2024, 10:42 AM · DBA

jcrespo created T363812: Setup backups for es6, es7 and archive old read only backups.

Apr 30 2024, 10:35 AM · database-backups, Data-Persistence-Backup

jcrespo updated the task description for T362509: Setup new dbprov hosts and decommission the old ones.

Apr 30 2024, 10:23 AM · Patch-For-Review, database-backups, Data-Persistence-Backup

Apr 25 2024

jcrespo added a comment to T361087: backup1005 crashed.

In any case, at this point I 'd prefer to do an in-place upgrade rather than a reimage, given how unreliable a reimage is and how impactful it can be for stateful services.

Apr 25 2024, 3:38 PM · SRE, ops-eqiad, DC-Ops, Data-Persistence-Backup, media-backups

jcrespo updated subscribers of T361087: backup1005 crashed.

In T361087#9744977, @cmooney wrote:
In T361087#9744384, @jcrespo wrote:
Booting failed (PXE):
PXELINUX 6.03 lwIP 20150819 Copyright (C) 1994-2014 H. Peter Anvin et al


Debian 12 (bookworm) amd64 (Wikimedia edition)

                                              boot: 
Loading debian-installer/amd64/linux... ok
Loading debian-installer/amd64/initrd.gz...
Boot failed: press a key to retry, or wait for reset...
Hmm. Not sure if we've seen this problem before. DHCP clearly worked as did the debian image download, but Linux failed to load for some reason.

@jcrespo the only difference was selecting bullseye rather than bookworm on the second attempt?

Apr 25 2024, 3:29 PM · SRE, ops-eqiad, DC-Ops, Data-Persistence-Backup, media-backups

jcrespo added a comment to T361087: backup1005 crashed.

If booted into bullseye.

Apr 25 2024, 11:40 AM · SRE, ops-eqiad, DC-Ops, Data-Persistence-Backup, media-backups

jcrespo (Jaime Crespo)
Sr Database Administrator

Projects (12)
View All

Calendar

Today

Tomorrow

Friday

User Details

Recent Activity
View All

Today

Yesterday

Mon, Jun 17

Fri, Jun 14

Wed, Jun 12

Fri, Jun 7

Tue, Jun 4

Wed, May 29

Thu, May 23

Wed, May 22

Tue, May 21

May 14 2024

May 13 2024

May 9 2024

May 7 2024

May 6 2024

Apr 30 2024

Apr 25 2024

jcrespo (Jaime Crespo)Sr Database Administrator

Projects (12)View All

Calendar

Today

Tomorrow

Friday

User Details

Recent ActivityView All

Today

Yesterday

Mon, Jun 17

Fri, Jun 14

Wed, Jun 12

Fri, Jun 7

Tue, Jun 4

Wed, May 29

Thu, May 23

Wed, May 22

Tue, May 21

May 14 2024

May 13 2024

May 9 2024

May 7 2024

May 6 2024

Apr 30 2024

Apr 25 2024

jcrespo (Jaime Crespo)
Sr Database Administrator

Projects (12)
View All

Recent Activity
View All