CMS DAS Pre-Exercises

General Information

CMS is a large collaboration of three thousand people, spread over several institutions, across five continents. It is very important to bring the collaborators up to speed, learning about the CMS detector and computing software, in order to contribute to physics data analysis. CMS maintains the WorkBook as much as up-to-date as possible to help guide its users. In addition to it, several tutorials (a.k.a. HATS) and workshops are held regularly to give a hands-on experience to the users. There are tutorials covering all aspects of CMS data analysis from reconstructed physics objects (e.g. muons, electrons, photons, jets etc) to trigger, generators, statistics, machine learning, software development and access to grid and computing resources. where one learns the basic CMS software tools. While these HATS are usually offered over several weeks during the late Spring/early Summer, CMS Data Analysis Schools (DASes) offer a boot camp model, concentrating all of the training in a very short period of time. In the beforetimes, DAS used to be in-person and 5 days long, during Covid-19 times it became fully remote, and it expanded into two weeks, with an asynchronous and a synchronous component. The CMSDAS@LPC2023 will be the first in-person school since 2020.

To prepare CMSDAS participants, a series of pre-exercises are required to be completed, with the laptop that will be used during the school. The goal of the pre-exercises is to make sure all basic issues can be resolved before the actual start of the school so that participants can hit the ground running.

Links to all the past and upcoming CMS Schools can be found HERE.

What: A series of pre-exercises to exercise all the needed tools with the laptop participants will bring to CMS DAS, so that they can be ready to go from the beginning of the school.

Who: All participants of the CMS DAS schools.

Where: This training will take place online.

When: Asynchronously from November, 2022 to mid-December, 2022.

Requirements: Participants must have access to a computer with internet access for which they have administrative privileges. Acceptable operating systems include Mac OS, Linux, or Windows (preferably not a tablet, Chromebook, etc.). The setup page will have more information about any additional pieces of software that must be installed or any accounts which must be obtained.

Accessibility: We are dedicated to providing a positive and accessible learning environment for all. Please notify the instructors in advance of the workshop if you require any accommodations or if there is anything we can do to make this workshop more accessible to you.

Instructors: Gabriele Benelli and Marguerite Tonjes

Helpers: Guillermo Antonio Fidalgo Rodriguez, Irene Zoi

Mattermost chat: The PreExercises channel will be available once you join the CMSDAS@LPC 2023 team. Direction for how to join this Mattermost chat team can be found on the setup page.

Contact: Please email cmsdasatlpc@fnal.gov for more information or assistance.

Prerequisites

Before going any further, please follow the instructions on the setup page.

Schedule

Setup Obtain the necessary accounts (warning: may take up to 4-6 weeks)
00:00 1. CMS Data Analysis School Pre-Exercises - First Set How do you setup a CMSSW release?
How do you find a dataset using the Data Aggregation Service (DAS)?
What are some EDM standalone utilities and what do they do?
What is MiniAOD and how do you use it?
01:00 2. CMS Data Analysis School Pre-Exercises - Second Set How to slim a MiniAOD file?
How to know the size of a MiniAOD file?
How to use FWLite to analyze data and MC?
01:30 3. CMS Data Analysis School Pre-Exercises - Third Set How do I do an analysis with so much data that I cannot run it interactively on my computer?
What is CRAB? How do I use it to run an analysis on the grid?
How do configuration files look like?
How do I extract the luminosity of the dataset I analyzed?
05:30 4. CMS Data Analysis School Pre-Exercises - Fourth Set How do we analyze an EDM ROOT file using an EDAnalyzer?
How do we analyze an EDM ROOT file using an FWLite executable?
How do we use ROOT/RooFit to fit a function to a histogram?
06:30 5. CMS Data Analysis School Pre-Exercises - Fifth Set How do I setup git on my computer/cluster?
How do I collaborate using GitHub?
07:00 6. CMS Data Analysis School Pre-Exercises - Sixth Set What is Jupyter?
What is pyROOT?
07:30 7. CMS Data Analysis School Pre-Exercises - Seventh Set What is an image? How about a container?
What is Docker/Singularity?
Why is containerization useful?
Ummmm…how is this different from a virtual machine?
08:30 Finish

The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.