feocco/bbLogs

bb-access log is parsed unintentionally

feocco opened this issue · 2 comments

The bb-access log does not need to be formatted but it fits the naming convention used for our bb logs.

We should exclude the access logs from parsing to reduce the time taken for parsing folders. As the access log HTML is an empty file but often the largest log. Therefore, takes the most amount of time to go through every line.

Can fix in fileFactory.py. This is how we grab files now.

def getBbFiles(self): dirFiles = [self.directory + '\\' + f for f in os.listdir(self.directory) if re.search(r'[b]{2}-\w+-log.*.txt', f)]

Fixt: fed1454