Monday, 2015-02-02

*** ricardoamaro (~ricardoam@drupal.org/user/74228/view) has joined #wikid10:13
*** ricardoamaro has quit (Ping timeout: 240 seconds)10:37
*** ricardoamaro (~ricardoam@drupal.org/user/74228/view) has joined #wikid10:38
*** ricardoamaro has quit (Ping timeout: 264 seconds)10:48
*** ricardoamaro (~ricardoam@drupal.org/user/74228/view) has joined #wikid10:49
*** ricardoamaro has quit (Remote host closed the connection)11:03
*** ricardoamaro (~ricardoam@drupal.org/user/74228/view) has joined #wikid11:03
*** joevano has quit (Ping timeout: 264 seconds)14:05
*** joevano (~joevano@bzflag/developer/JoeVano) has joined #wikid14:05
*** nowen (~nowen@99-174-92-191.lightspeed.tukrga.sbcglobal.net) has joined #wikid14:28
laszlofhttps://plus.google.com/u/0/107271417393296791459/posts/P4DkYp9xbu9?pid=6111240239850136322&oid=10727141739329679145914:31
laszlofgot a little snow14:31
nowenwoah15:11
laszlofya.. 16" in 24 hours15:13
laszlof3rd biggest snow storm in history for this area15:13
laszlofoffice is empty today15:14
laszlofhaha15:14
nowendo you have 4wd?15:32
laszlofno, I have a car.15:32
laszlofa small car15:32
laszlofdriving to the office was fun15:32
laszlof:)15:32
*** ricardoamaro has quit (Quit: Leaving.)17:57
*** AccentureDan (3f7c1664@gateway/web/freenode/ip.63.124.22.100) has joined #wikid18:48
AccentureDanhey nick18:48
AccentureDanquick question18:48
nowenhey AccentureDan18:48
nowenok18:48
AccentureDanseeing some weird errors in WiKID18:48
AccentureDan2015-02-02 10:40:16.869ERRORcom.wikidsystems.client.wClientERROR: java.io.IOException: failed to decrypt safe contents entry: javax.crypto.BadPaddingException: Given final block not properly padded18:48
AccentureDanthoughts?18:49
nowenwhat changed?18:49
AccentureDanwell, after we finished with our auto-failover scripting...the WiKID servers remain up and in an active-passive pair...but we have been holding sessions to make sure users can request OTPs from WiKID18:50
AccentureDanwith a bunch of users trying to request OTPs around the same time18:50
AccentureDanthe WiKD service becomes hung18:50
AccentureDani have to restart the services on the WiKID master server to get it to produce OTPs again18:51
nowenwhat do you mean by 'holding sessions'?18:51
AccentureDancom.mchange.v2.resourcepool.BasicResourcePool@736d1c2f -- an attempt to checkout a resource was interrupted, and the pool is still live: some other thread must have either interrupted the Thread attempting checkout!18:51
AccentureDangetting this as well18:51
nowenwhat version of WiKD?18:51
AccentureDanholding sessions where we get 20-30 of the people who are registered to retry requesting one time passcodes18:51
AccentureDanjust in case they forgot their PIN or password for the token client18:51
nowenok18:51
AccentureDan4.0 build 0-b180318:52
AccentureDani know we need to upgrade haha18:52
AccentureDanhave you seen these issues before?18:52
nowenit is most likely the memory leak we fixed in b181718:53
AccentureDangotcha...so think that is related to the hanging we are seeing?18:53
nowencan you run 'locate java.security' and post or email the one that is not in /opt/WiKID?18:53
AccentureDansure18:53
AccentureDan[root@pdlptoinf04 WiKID]# locate java.security /opt/WiKID/WiKIDbackup_111714/conf/templates/java.security /opt/WiKID/conf/templates/java.security /opt/WiKIDbackup_111714/conf/templates/java.security /opt/WiKIDbackup_12162014/conf/templates/java.security /usr/lib/jvm/java-1.6.0-openjdk-1.6.0.0.x86_64/jre/lib/security/java.security.rpmsave /usr/lib/jvm/java-1.6.0-openjdk-1.6.0.34.x86_64/jre/lib/security/java.security /usr/lib/jvm/j18:53
AccentureDansorry about that18:54
AccentureDanthat is what I am seeing18:54
AccentureDanwhen i run that18:54
AccentureDanon the master WiKID server18:54
nowenok, what is in usr/lib/jvm/java-1.6.0-openjdk-1.6.0.34.x86_64/jre/lib/security/java.security?18:54
nowenor run 'diff /opt/WiKID/conf/templates/java.security /usr/lib/jvm/java-1.6.0-openjdk-1.6.0.34.x86_64/jre/lib/security/java.security'18:55
nowenis there a difference in the files?18:55
AccentureDanlemme email those commands18:56
AccentureDanoutputs*18:56
AccentureDanokay sent18:58
nowenthe pool error and not being able to get OTPs should be fixed in the upgrade18:58
AccentureDanthis was what i got when i did the diff18:58
AccentureDangotcha so probably seeing an issue in this version. not related to the changes we made, which is what i thought18:58
nowenhow much RAM in each server?18:58
AccentureDanbrb one sec18:58
AccentureDanback19:03
AccentureDan65 GB19:04
AccentureDanper machine19:04
AccentureDandont ask LOL19:04
nowenof RA<?19:04
AccentureDanyep19:04
laszlofchrist19:04
AccentureDani know LOL19:04
AccentureDanthey were repurposed Oracle X3-2's19:04
laszlofoverkill much?19:05
AccentureDanbeyond overkill19:05
AccentureDandont even ask about available storage LOL19:05
laszlof24PB?19:05
laszlof;)19:05
AccentureDanLMAO19:05
nowenok - run 'cp /opt/WiKID/conf/templates/java.security /usr/lib/jvm/java-1.6.0-openjdk-1.6.0.34.x86_64/jre/lib/security/java.security'19:05
AccentureDan1 TB, close enough :)19:05
AccentureDanwill do19:05
nowenand 'wget http://wikidsystems-dl.com/wikid-server-enterprise-4.0.1.b1821-1.noarch.rpm'19:05
nowenand 'rpm -Uvh wikid-server-enterprise-4.0.1.b1821-1.noarch.rpm'19:06
nowenon both servers and restart19:06
nowenseems like it would take a long time for a memory leak to get noticed19:06
AccentureDanokay downloading19:07
AccentureDanwill have to update either tonight or tomorrow after these trainings are done...should be fun...you should see the amount of work we put in to making sure these master and slave servers auto-failover19:13
AccentureDanits disgusting19:13
AccentureDanobviously customized for our environment and contractual obligations19:13
nowenno doubt19:14
AccentureDanas promised i will annotate our changes and forward on what we did...we created a solution for failing over, utilizing crontab, and made some modifications to the code itself...including using a master script...all running from a secondary server with a separate instance of WiKID19:16
nowencool19:16
*** AccentureDan has quit (Ping timeout: 246 seconds)19:32
*** nowen has quit (Quit: Leaving.)22:23

Generated by irclog2html.py 2.11.0 by Marius Gedminas - find it at mg.pov.lt!