The problem is in the CES memory associated with the event-filter formatter Currently it is (almost always) necessary to recycle the power of the VME crate of the affected processor to fix the problem. Since during the run that will stop the taping, this fix is best done when tapes are changed. Do the following:
> cesd 03100000 8000 B 4741 4D4D 4153 5049 4552 4521 03100010 0 8181 0 8181 0 8181 8D5 1 03100020 0 3 6425 3665 0 1 FFFF FFFF 03100030 8000 F 4741 4D4D 4153 5049 4552 4521 03100040 27BD 103 0 1 0 1 0 1 03100050 0 8181 0 8181 0 8181 BF2 1 03100060 0 3 6425 371F 0 1 FFFF FFFF 03100070 8000 F 4741 4D4D 4153 5049 4552 4521 03100080 27BF 103 0 1 0 1 0 1 03100090 0 8181 0 8181 0 8181 155 1 031000A0 0 3 6425 37D9 0 1 FFFF FFFF 031000B0 8000 F 4741 4D4D 4153 5049 4552 4521 031000C0 27BC 103 0 1 0 1 0 1 031000D0 0 8181 0 8181 0 8181 45D 1 031000E0 0 3 6425 3895 0 1 FFFF FFFF 031000F0 8000 F 4741 4D4D 4153 5049 4552 4521 Randy pointed out where the stuck bit was: > > cesd > 03100000 8000 B 4741 4D4D 4153 5049 4552 4521 > 03100010 0 8181 0 8181 0 8181 8D5 1 > 03100020 0 3 6425 3665 0 1 FFFF FFFF > 03100030 8000 F 4741 4D4D 4153 5049 4552 4521 > 03100040 27BD 103 0 1 0 1 0 1 Stuck bit: 1 1 1 1 > 03100050 0 8181 0 8181 0 8181 BF2 1 > 03100060 0 3 6425 371F 0 1 FFFF FFFF 1 1 1 1 > 03100070 8000 F 4741 4D4D 4153 5049 4552 4521 1 1 1 1 > 03100080 27BF 103 0 1 0 1 0 1 1 1 1 1
On 2/11/98 a new version of the dump program was installed and conditions have to be really bad before this program gives up getting the EFF spectra. However, not all utilities have been upgraded (that is a big job) so you may still have stop or slow down the acquisition as described above.
eff _effcontrol 70d024 210 SUSPEND d0d0d8 70cf74 0 0then you may be able to simply resume the task by typing:
tr 0x70d024where "70d024" is the number in the third column (the 0x in front signifies that it's a hex number). If the EFF task comes alive, you can use the processor again.
-> Access Fault Program Counter: 0x00d0d0e0 Status Register: 0x3004 Access Address : 0x0510000c Special Status : 0x0085 Task: 0x70d024 "eff"In that case the next step is to reboot the processor. You can do that by typing "reboot" in the tip window or push the red reset button on the front of the processor.
The [1]germanium high temperature monitor, [2]ln-alarm monitor and [3]VXI crate monitors will send alarms to members of the
/home/gam13/acct/gs/.BEEPERADDRlist and the [4]ln-fill monitor is using the list
/home/gam13/acct/gs/.LNFILMONHere is an example of how the .BEEPERADDR might look like:
[email protected] [email protected] [email protected]In this case, two beepers are notified when there is an alarm. One E-mail to a regular account (via a mail alias) is sent as well.
Here is an example of how the .LNFILMON file might look:
[email protected] [email protected]People on this list will get the 'ln fill started', 'ln fill done' and 'tank fill done' information messages every time the GS array is filled.
press store press either 1, 2 or 3 enter number press storeYou should hear a one second beep when the number is stored. It isn't really necessary to touch or change the message of the dialer -- it doesn't get through to the alpha numeric beepers anyway. On these beepers, the dialer generates a "020202" message or the like.
The dialer can be programmed for three numbers. You select those in the second line.
to disable a number: press store, press either 1, 2 or 3, press store. I.e., do NOT enter a number. Don't put in 0 since it will dial the operator!!!
The LN dial wires should connect to the outermost contacts of the alarm inputs.
NOTICE: Remember to put 4 in front of the beeper number on the dialer (E.g., my beeper is 44026 on the dialer!!!)
We are not sure what causes this hang-up; but the fix is simple: put the cursor in any of the trigger condition input windows and press return. That will cause the trigger maps to be reloaded in the MTM and data will start flowing again.
Check on Darek, he sometimes forgets to restart the mrmto program...
# This script is executed at the end when the # EFF processors boot # set BGO honeycomb time limits bgo_sum_time_lo=3204 bgo_sum_time_hi=3720 # set lo res sum energy time limits ge_sum_time_lo=3840 ge_sum_time_hi=4160The germanium limits are set with the variables: 'ge_sum_time_lo' and 'ge_sum_time_hi'. The BGO time limits are set by the variables: 'bgo_sum_time_lo' and 'bgo_sum_time_hi'.
The values in the "user/eff/user.eff" file take effect when you boot the EFF processors. If you want to change the limits without rebooting, you must type these lines in ALL the six EFF processor tip windows. (E.g, simply type "bgo_sum_time_lo=3204" in a tip window.) You can also see what the current limits are by typing the above names without values. (Try typing "bgo_sum_time_lo" - it should come back with the value the EFF has at the moment).
If you set the time limits based on spectra dumped by the EFFs you must take into account the contraction these spectra have:
ge times: /8 BGO times: /4The time limits you specify for the EFF code must be for uncontracted times.
tc /dev/rmt/1mbn /dev/rmt/0mbn 0 112 90 16384That will copy the tapeheader (siz=112), the file header (siz=90) and all data blocks (siz=16384) from the input tape, /dev/rmt/1mbn, to the output tape, /dev/rmt/0mbn. If the last block is truncated, it will not be copied either (we don't want to!). It doesn't matter if there is a double end of file mark on the input tape or not -- tc will do the right thing.
tc /dev/rmt/1mbn /dev/rmt/0mbn 0 90 16384i.e, we copy as before but we exclude the tape header. Daphne gets very perplexed if it sees two of those on one data tape...
mt -f /dev/rmt/0mbn weof
That should generate a compressed tape that Daphne will not choke on. However, this solutions does not work directly if you need to split an input tape on two output tapes - we'll have to think about how to do that properly.
Also, DAPHNE is being modified to be able to handle imperfect tapes, so this problem should go away soon.
As of late March/98, this should no-longer be a problem
This is a bug in the EPICS version we are running. Simply kill the EPICS windows you have up and start the gs commander again.
This is an X problem - which is not understood at the moment. The fix is to log out from the console and log in again. That resets whatever is causing the problem -- and you can again control the experiment.
This variable must match the delay that you have set in the FERA/VXI board for the EFF code to understand and recognize external data.
The variable is set when you boot the EFF processors when the file:
/home/gam13/acct/gs/user/eff/eff.useris executed. Look for a line like:
fera_delay=0x7c00This word is constructed from:
01dd|dd00|0000|0000where dddd is the dip-switch setting. (bit 14 is always 1 for reasons that are less than obvious). For the two typical dip-switch settings we use:
dip switch binary word hexword delay used 1001 0110|0100|0000|0000 0x6400 (30 usec delay) 1111 0111|1100|0000|0000 0x7c00 (50 usec delay)If you change the dip-switch value you must 1) modify the eff.user and 2)
fera_delay=0x7c00
> ping -s gamvxi01 PING ....: 56 data bytes 64 bytes from ..........: icmp_seq=0. time=895. ms 64 bytes from ..........: icmp_seq=1. time=1981. ms 64 bytes from ..........: icmp_seq=2. time=984. ms 64 bytes from ..........: icmp_seq=3. time=1279. ms 64 bytes from ..........: icmp_seq=4. time=284. mswhereas you should see a response like:
> ping -s gamvxi02 PING .....: 56 data bytes 64 bytes from ..........: icmp_seq=0. time=5. ms 64 bytes from ..........: icmp_seq=1. time=5. ms 64 bytes from ..........: icmp_seq=2. time=6. ms 64 bytes from ..........: icmp_seq=3. time=5. ms 64 bytes from ..........: icmp_seq=4. time=12. ms 64 bytes from ..........: icmp_seq=5. time=4. ms 64 bytes from ..........: icmp_seq=6. time=5. ms 64
We are not sure what causes this problem; but rebooting the processor fixes the problem. Please note that crate 4 is notorious for not getting the UNIX time and thus sometimes needs to be rebooted many times before it comes up. The master crate occasionally has the same problem; but it usually gets the time in a couple of tries.
This can happen because the EFF's may still have some old data stored and if you are not careful about the 'tape on/off' there may be some old data in the 64MB memories as well that gets flushed (or rather pushed) when new data comes in.
To avoid this do:
pulse_con 1,011 (for highest rate) pulse_con 1,012 pulse_con 1,013 (for lowest rate)Then go to the scripts directory and set the amplitudes:
pulser_amp crate/all value(typical values bt 1 and 5).
To turn the pulsers off again, type
pulse_con 0in the master console window and use the pulser script to turn the amplitudes down again. (The latter action is probably not necessary).
Problem: The problem is the SCSI terminal server.
Solution: It is necessary to cycle the power on the terminal server. Since it is a SCSI device, this should be done when con1 is not running or the internal disk in the machine could be damaged. Thus, do
EV len cln dty bgo ttH ttM ttL tac1 tac2 sge sbgo ** 24 04 05 15 0014 27079 16015 1814 2545 0304 0982 ev# id ehi g p o tge ebgo tbgo elo esid tc bgohitp tRF hs 0 078 00576 1 0 0 3938 0000 0000 0000 0074 0000 0000000 2393 0 1 096 01809 1 0 0 3982 0000 0000 0000 0000 0000 0110101 2437 1 2 045 01007 1 0 0 3983 0000 0000 0000 0000 0000 0000000 2438 0 3 085 01708 1 0 0 3943 0000 0000 0000 1745 0000 0000000 2398 0 4 059 05056 1 0 0 3996 0026 1968 0000 0000 0000 0100000 2451 0 5 071 02571 1 0 0 3990 0241 2030 0000 0339 0000 0010100 2445 0 6 079 00496 1 0 0 3993 0908 2058 0000 0022 0000 0000011 2448 0 7 105 00419 1 0 0 4013 0258 2048 0000 0000 0000 0001000 2468 0 8 109 00899 1 0 0 3958 0000 0216 0000 0000 0000 0000010 2413 0 9 030 00000 0 0 0 0000 0045 0216 0000 0000 0000 0001000 -1545 0 10 040 00000 0 0 0 0000 0132 2035 0000 0000 0000 0001000 -1545 0 11 048 00000 0 0 0 0000 0272 0777 0000 0000 0000 1000000 -1545 0 12 050 00000 0 0 0 0000 0076 2028 0000 0000 0000 0001000 -1545 0 13 056 00000 0 0 0 0000 0153 2027 0000 0000 0000 0001000 -1545 0 14 086 00000 0 0 0 0000 0060 1974 0000 0000 0000 0011000 -1545 0 15 092 01856 0 0 0 0000 0276 2051 0000 0000 0000 1000000 -1545 0 16 094 00000 0 0 0 0000 0354 2049 0000 0000 0000 0100000 -1545 0 17 108 00000 0 0 0 0000 0156 2037 0000 0000 0000 0000010 -1545 0 18 027 00000 0 0 0 0000 0356 2049 0000 0000 0000 0000110 -1545 0 19 043 00000 0 0 0 0000 0289 2043 0000 0000 0000 0011000 -1545 0 20 063 00000 0 0 0 0000 0194 1882 0000 0000 0000 0100000 -1545 0 21 087 00000 0 0 0 0000 0086 0212 0000 0000 0000 0000010 -1545 0 22 095 00000 0 0 0 0000 0263 2054 0000 0000 0000 0000001 -1545 0 23 097 00000 0 0 0 0000 0203 2047 0209 0000 0000 0100000 -1545 0 <----how can event 23, which is supposed to be a clean bgo, have an elow value?
The energy in the germanium was probably so high that it was overrange on the high res adc, thus it was not classified as a dirty germanium as it really should have been.
m 0xffffc632in the master tip window. It should respond as:
ffffc632: 0000-at this point type
ffffc632: 0000-10and hit return. It will move to the next memory location and display something like
ffffc634: 00ff-type . and hit return to get out of the program.
SrcTTLtrig -1,3,0,0and to set LE mode
SrcTTLtrig -1,3,1,0You must execute these commands in each of the 6 VXI console windows
<1usecand to set 2 usec timing, excute
<2usecYou must execute these commands in each of the 6 VXI console windows