lava-devel July 2019

lava-devel@lists.lavasoftware.org

3 participants
6 discussions

Design meeting minutes from 2019-07-31

by Remi Duraffort

Hi folks, We held our regular design meeting today via Hangout. Summary of brief discussion: # 31st July 2019 # Using celery inside LAVA [Rémi] Could be used for: * Parsing description.yaml * Replace crontabs * Compressing logs * Removing old jobs * Sending logs to ES * Splitting scheduling * notifications # August release ? [Steve] Yes - aim for 28th The release will have database changes Tag the day before to let some time for lavafed to test the tag # LAVA sessions for Connect [Steve] Three sessions has been accepeted: * LAVA Users' Forum * Hacking and contributing to LAVA * Advanced testing in python # Support packages in Debian removed/being removed [Steve] * lava-tool removed (https://tracker.debian.org/pkg/lava-tool) * django-compat marked for removal ( https://tracker.debian.org/pkg/django-compat) * django-hijack depends on django-compat, marked for removal too ( https://tracker.debian.org/pkg/django-hijack) * Rémi will remove the dependency/mention * VLANd needs some rework to go to [python 3]( https://git.lavasoftware.org/lava/vland/issues/5) # Migrating to django 2.2 (next LTS release) [Milosz] * lava source code itself is compatible with django 2.2 * Dependencies on filters and rest framework * Still waiting for a release compatible with django 2.0 * Trying to contact the maintainer to give some help (if needed) # gitlab-runner on aarch64 [Steve] The current package in [Debian]( https://tracker.debian.org/pkg/gitlab-ci-multi-runner) was a bit too old. GitLab don't provide anything for arm64 but the Debian maintainer uploaded a new version that fixe our issues. ============================================================================ The LAVA design meeting is held weekly, every Wednesday at 13:00 to 14:00 UTC using Google Hangouts Meet: https://meet.google.com/qre-rgen-zwc Feel free to comment here or join us directly in the meeting. Minutes from this and previous meetings are also stored in the LAVA wiki: https://git.lavasoftware.org/lava/lava/wikis/design-meetings/index Cheers, -- Rémi Duraffort LAVA Team, Linaro

6 years

Design meeting minutes from 2019-07-24

by Remi Duraffort

Hi folks, We held our regular design meeting today via Hangout. Summary of brief discussion: # 24th July 2019 # install.git-deps [milosz] This feature works nicely: https://lkft.validation.linaro.org/scheduler/job/834097 Proposal: keep `install` option but restrict it so it’s not trying to install system packages. [Rémi] Will submit a patch to remove the “deprecation” warning in the documentation. # Authentication refactoring [milosz] Under review by Remi. Looks good. # Connect sessions where accepted [Rémi] * LAVA users forum * Hacking and contributing to LAVA * Advanced testing in python # Playing with Sentry error reporting [Rémi] * Will create a ticket to have it installed in the linaro lab. * Will create sentry.lavasoftware.org * No debian package available for python3-sentry-sdk * Should be installed from pip (sentry-sdk) * Will send a patch to install sentry-sdk from pip in lava-server docker container. * Activate it for lavafed instances. ============================================================================ The LAVA design meeting is held weekly, every Wednesday at 13:00 to 14:00 UTC using Google Hangouts Meet: https://meet.google.com/qre-rgen-zwc Feel free to comment here or join us directly in the meeting. Minutes from this and previous meetings are also stored in the LAVA wiki: https://git.lavasoftware.org/lava/lava/wikis/design-meetings/index Cheers, -- Rémi Duraffort LAVA Team, Linaro

6 years

Design meeting minutes from 2019-07-17

by Remi Duraffort

Hi folks, We held our regular design meeting today via Hangout. Summary of brief discussion: # 17th July 2019 # Large job definitions causing outages [deanb] * Issue: https://git.lavasoftware.org/lava/lava/issues/299 * Wondering if for large jobs (configurable limit) simply not making ActionData objects is a sensible approach. * Tried this: https://git.lavasoftware.org/dean-birch/lava/commit/dd220c0bd82bf092e35e643… * In my test instance this reduced outage to 30 seconds (from hours). * If not, what else can we do? * Anything extra needs to be added? * Documentation? * [deanb] will send a patch with the first improvements (CLoader) * [deanb] will look at using bulk save to save all objects in one call * [stevan] investigate ActionData: is it possible to create them later on or even maybe not creating them? # Test from inline with git [milosz] The idea is to source test-definition YAML from inline but use git repository to prepare overlay. Example: https://github.com/andersson/bootrr Bjorn doesn’t want to have YAML file in this repository [Rémi] Using install.git-repos might work * https://git.lavasoftware.org/lava/lava/blob/master/lava_dispatcher/tests/te… * https://docs.lavasoftware.org/lava/lava_test_shell.html#adding-git-bzr-repo… [milosz] will try install.git-repos If that’s working Rémi will add some tests in lavafed or meta-lava # Switching between serial connections on device with multiple UARTs [Malcolm Brooks] * Issue: We have devices which use separate serial outputs for MCC, AP and SCP UARTs. * Workaround: Use the `new_connection` boot method to switch between UART1 and UART2 in order to catch the kernel booting once MCC flash stage is complete. * Idea: Allow all connections (or possibly a subset defined in the “connection_tags” for example) to be established and followed from the beginning of the job, and allow each action/stage to select which they are actually listening/interacting with the `connection` option (example below). ```yaml - boot: namespace: target connection: uart1 method: minimal ``` [Rémi] Sounds like a good idea. * Using feedback LAVA can already use one connection and listen/print the other ones * Malcom will create an issue on gitlab. ============================================================================ The LAVA design meeting is held weekly, every Wednesday at 13:00 to 14:00 UTC using Google Hangouts Meet: https://meet.google.com/qre-rgen-zwc Feel free to comment here or join us directly in the meeting. Minutes from this and previous meetings are also stored in the LAVA wiki: https://git.lavasoftware.org/lava/lava/wikis/design-meetings/index Cheers, -- Rémi Duraffort LAVA Team, Linaro

6 years

Large job definitions in LAVA

by Dean Birch

Hi All, We've previously had an issue on our LAVA instance where it stopped responding to workers and stopped dispatching jobs when it finished running large job definition (around 25000 lines in the definition, around 1000 deploy/boot/test actions). I've been looking into reproducing this safely in a development environment, and I've got a few observations and questions about how the situation could be improved. The lava-master process appears be stuck processing the job results, and takes a painstakingly long time to finish this and send an ACK for END_OK. During this processing, the master doesn't respond to worker pings, and doesn't schedule other jobs. Tracking a bit deeper, it seems that the vast majority of time (I've never seen it finish as I have always restarted the lava services before it finishes) in the walk_actions and build_action functions of the lava_results_app/dbutils.py file: https://git.lavasoftware.org/lava/lava/blob/2019.05.post1/lava_results_app/… https://git.lavasoftware.org/lava/lava/blob/2019.05.post1/lava_results_app/… What options is there to mitigate this issue? Some ideas below: - Could we optimize the build_action function? There are a few Django model/db queries in build_action, could some results be queried once and cached? With an obscenely large job, would this even give us enough savings to make the time invested in safely optimizing this worth it? - What are the implications of not having created ActionData objects for a job? Does this mean that no options will be available in the "Pipeline ↓" drop-down on the job page for quick navigation? Could we optionally abort after a certain amount of these (and make it configurable per LAVA instance)? - Should/could the handling of the results be forked off, so lava-master can continue to schedule more jobs and respond to worker pings, but slowly the ActionData objects can be populated? I'm unsure if you have to be on a special thread to write to Django models. Even if this could be done, would any weird behaviours occur on the slave side as it will still be waiting for the ACK for END_OK from the master? Any guidance on how to proceed with this would be appreciated! I'm happy to place this and some more details in as a LAVA issue on git.lavasoftware.org if this is easier to track and discuss. Thanks, Dean

6 years, 1 month

No design meeting for 2019-07-10

by Steve McIntyre

Hi folks, As Rémi and Stevan are both out and we don't have any items listed for discussion in advance, I'm cancelling today's design meeting. Cheers, -- Steve McIntyre steve.mcintyre(a)linaro.org <http://www.linaro.org/> Linaro.org | Open source software for ARM SoCs

6 years, 1 month

Design meeting minutes from 2019-07-03

by Steve McIntyre

Hi folks, We held our regular design meeting today via Hangout. Summary of brief discussion: 3rd July 2019 1. [Rémi] 2019.07 release? 1. Should we do one? 2. Most people will be out for most of the month 3. Maybe worth it for the LITE group (depends on the number of patches). 4. Steve back from DebConf on the 28th, but… 5. Yes: releasing around the 18th 2. [Rémi] Debian buster is due soon 1. Basing the docker image on Buster? 2. No, wait for a little bit. Maybe 2019.08? 3. Staging is already running Buster, main v.l.o is still on Stretch but the lab team will want to upgrade soon 4. How long do we support stretch-backports? 5. Add buster-backports soon, as new uploads will hit Debian unstable (---> Bullseye). 6. Target 2019.08 at all three releases (stretch-backports, buster-backports, bullseye) 3. [Rémi] Recommendations about VACUUM ANALYZE 1. This should be run regularly (every day) on busy instances to clean up 2. Add a thing in the docs, test in the lab 3. See https://www.postgresql.org/docs/11/sql-vacuum.html for more info - does a VACUUM then ANALYZE without the old data. 4. Lets the DB self-optimise for performance 4. [Rémi] Using git submodule to include docker sources into lava sources 1. Still a separate repository 2. The exact commit hash used for the lava docker image is now known and reproducible. 1. This is the main reason 2. Using version.py on the last commit of the docker directory can also work. 3. See https://git.lavasoftware.org/lava/lava/merge_requests/637 4. Let's go with this instead of git submodule, it works fine ============================================================================ The LAVA design meeting is held weekly, every Wednesday at 13:00 to 14:00 UTC using Google Hangouts Meet: https://meet.google.com/qre-rgen-zwc Feel free to comment here or join us directly in the meeting. Minutes from this and previous meetings are also stored in the LAVA wiki: https://git.lavasoftware.org/lava/lava/wikis/design-meetings/index Cheers, -- Steve McIntyre steve.mcintyre(a)linaro.org <http://www.linaro.org/> Linaro.org | Open source software for ARM SoCs

6 years, 1 month

2025

2024

2023

2022

2021

2020

2019

2018

lava-devel July 2019