OWG Meeting 1-Jul-2015

From GlueXWiki
Jump to: navigation, search

Location and Time

Room: CC F326-327

Time: 1:30pm-2:30pm

Connection

You can connect using BlueJeans Video conferencing (ID: 120390084). (Click "Expand" to the right for details -->):

(if problems, call phone in conference room: 757-269-6460)

  1. To join via Polycom room system go to the IP Address: 199.48.152.152 (bjn.vc) and enter the meeting ID: 120390084.
  2. To join via a Web Browser, go to the page https://bluejeans.com/120390084.
  3. To join via phone, use one of the following numbers and the Conference ID: 120390084
    • US or Canada: +1 408 740 7256 or
    • US or Canada: +1 888 240 2560
  4. More information on connecting to bluejeans is available.


Previous Meeting


Agenda

  1. Announcements
  2. Power Outage
  3. "Too many Windows" and RHEL6 (or 7?)
  4. RCDB deployment
  5. Major systems reports
    • Controls
    • DAQ


Minutes

Attendees: David L. (chair), Simon T., Sergey F., Mahmoud K., Bryan M., Sean D., Curtis M., Beni Z., Vardan G.

Announcements

  • Paul L. was installing RHEL7 on gluon46 today at Dave A.'s request so that can try using our system to track down issues with CODA on that platform
    • gluon46 still has issues with spontaneous rebooting so this exercise should help determine whether this is due to hardware or software
    • a separate disk is used for this so we can easily switch back to RHEL6
  • A cleanup of the hdops home directory was done today. See David's log entry for details.

Power Outage

  • Since the last meeting we have had the long power outage and recovery. No major issues were reported regarding the counting house systems except for an issue with gluon24 that Hovanes noted in a logbook entry. Hovanes was not at the meeting so David will follow up with him later to find the status.

"Too many Windows" and RHEL6 (or 7?)

  • David reported having spent some time with Paul Letta this morning trying to debug this issue.
    • A script was written to open 100 terms and then close them. Initially, this resulted in reproducing the error. However, after logging out and then back in, the problem did not present. A time factor may be involved.
    • Paul suggested we could try using KDE instead of gnome
    • Hall-B is rumored to have implemented a solution for a similar problem. David will check with Sergey B. for details.
    • Problem could be related to multiple computers using same home directory. We tried temporarily giving gluon04 a local home directory for hdops. The script did not produce the error here, but when we went back to the original configuration it did not produce the error either so we cannot conclude anything about the result of that test.
    • Last solution will be upgrade to RHEL7. No one has reported this problem using gluon01 which is running RHEL7. It has not been used for running the DAQ though which seems the most common place where we see this error.

RCDB Deployment

  • Sean inquired about RCDB deployment. He is interested since the offline monitoring program will need to be modified to use it.
  • David and Sergey reported that development is basically done, but it needs to be deployed with an outward facing web server and DB that the web server can see. (i.e not just running in the counting house)
  • The optimal solution would be to have Dmitry on site to help with the deployment. However, there continue to be bureaucracy issues with having him visit JLab like he has done in the past. Sergey relayed the latest information he heard was that Dmitry would not be allowed to book a trip here before the end of August at the earliest.
  • In order to get the system deployed early enough for the offline monitoring group to fully integrate and test it prior to the Fall run, we will need to work on this without having Dmitry at JLab. Sergey and David will pursue what is still needed for the web server and DB server to be setup with proper replication from inside the counting house to outside. They will report on progress at the next Online meeting in 2 weeks.

Major Systems Reports

Controls

  • No representative from Controls was present at the meeting again. David will ask if they plan to continue participation or limit themselves to the separate Controls meeting.

DAQ

  • No significant DAQ testing has been done since the last meeting due to the power outage
  • Vardan asked that we let the CODA group know if we will be doing any significant DAQ testing so that they can participate.

L3

  • Curtis asked about the state of Level-3. No significant progress was reported.