¿ªÔÆÌåÓý

Re: Rescrapeing Queries


 

On Thu, Nov 14, 2019 at 03:59 PM, J Forster wrote:
One of my groups with about 70,000 messages was transferred and the after action report showed perhaps a score of messages not transferred because of Ysahoo issues.
J -- With metrics like that, I believe you should thank your lucky stars!

I've looked up the message numbers and they are present on Yahoo still. I believe it's possible to retransfer just the messages.
The message numbers do not map 1:1, largely because some messages in the Y!G were probably deleted long before you ever asked for a transfer. Did you actually read the messages left behind in the Yahoo Group and confirm that they are not present in the transferred data??

Another source of missing messages can occur if you allow posts to continue in your Yahoo Group after the transfer agent has already completed the scraping. (A quaint term -- never heard of that before)

Queries:

Is it necessary to delete the entire message file on Groups.io?
Yes, unless you're willing to meticulously go through every message and delete 70,000 dupes. Deleting all those messages will itself be rather burdensome, to put it mildly. Please consider that a repeat transfer could run into even more Yahoo glitches and leave you with a worse situation than you already have.

What happens to the messages received by Groups.io in the interim between group startup and the new transfer?
They are retained, and these are the ones with low message numbers.

At the moment, those messages seem to be assigned a new number series series, starting at #1.
Message numbers are assigned in sequence by groups.io as they arrive, whether by new posts or via the transfer agent.

After? the rescraping and reload, would the #1 to #5 be lost?
If you don't delete them first, they will be retained, along with every other message currently in your message archive.

Hope this helps,
Bruce

Join [email protected] to automatically receive all group messages.