I am dealing with the processing of a large amount (several TB) of Outlook PST files. Unfortunately, these PSTs contain many Bloomberg chat messages that have been stored in the PSTs as emails, with no special formatting. Some of these Bloomberg chat messages are causing processing to fail. The issue seems to be that the message BODY contains a ton of chat recipients, all separated by the pipe character ( | ). We're talking tens of thousands of recipients. And so I need a way to identify the offending messages and move them. These Bloomberg emails appear to make up at least half or more of the PSTs.
First Requirement:
- Subject contains "Bloomberg Message" OR "Bloomberg_Message"
Second Requirement:
- Identify messages where the text contains more than 210 instances of the pipe character ( | ). These present the recipients of the Bloomberg chats within the text of the messages, rather than the headers. There, we find thousands of recipients, separated by the pipe character ( | ). So counting the pipes and moving messages with more than X number seems a good solution, if possible.
For messages meeting these criteria, the goal will be to move them from whatever folder they reside in to another new folder.
Here's a sample of the message text [edited to remove personal information], which goes on to list more than 25,000 recipients. Again, this is contained in the BODY of the message, rather than the Outlook address fields.
Screen Shot 2019-04-19 at 8.59.02 AM.jpg
Thanks!