Solved Lazy man's question about file copying

teckneeculler

Member
Power User
VIP
Local time
4:11 AM
Messages
512
Just finished setting up a new W7 system and I want to selectively copy a large number of Word docs.

My new system's main drive is a 240Gb SSD (which I'm trying to keep uncluttered) and I've mounted three other regular SATA HDDs that contain archived collections of documents.

An 'Everything' search of my system reveals around 30,000 Word docs that are stored on the three support drives.

Many docs are duplicates. For example, because of past backups, and subsequent backups of backups, there might be 4 or 5 copies of file 'XYZ.doc' in these archives.

My aim is to locate just ONE copy of each duplicated doc and copy it to a new folder on my SSD.

Question: Does anyone know of software that could make this process as simple as possible?

Obviously, I can manually locate the files with 'Everything' or Windows search, but I don't want to have to deal with every file individually. Even if there are 5 duplicates of each file, there are still 6,000 files to locate and copy.

Hey, what I'm asking might be impossible, but if you don't ask, you don't get :-)
 

My Computer My Computer

Computer type
PC/Desktop
Computer Manufacturer/Model Number
My build
OS
Win7 Ultimate SP1
CPU
Intel Core i5 9400 Coffee Lake 14nm
Motherboard
Asus Prime H310M-E R2.0 (LGA1151)
Memory
16G DDR4
Graphics Card(s)
NVidia GeForce GTX 960
Sound Card
Realtek High Definition Audio (mobo)
Monitor(s) Displays
Asus PA248 24" 16:10 format
Screen Resolution
1920x1200@59Hz
Hard Drives
1863GB Seagate ST2000DM
3726GB Seagate ST4000DM
1863GB Seagate ST2000DM
6TB Seagate
465GB NVMe Samsung SSD 970
PSU
750G2
Case
Tower
Cooling
Standard
Keyboard
USB
Mouse
USB
Internet Speed
920Mbs/480Mbs
Antivirus
Malwarebytes
Browser
Firefox
I'm not sure you can do what you want in only one step.

One approach would be to copy all the files from your various folders into one folder (on one of your support drives) and then run a duplicate finder program to keep only one copy.

Another is to use a program such as SyncBack to copy the various folder contents, one folder at a time, into the receiving folder, which should result in only one copy of each unique file name.

I use SyncBack quite often and it is relatively easy to use. I think this approach would probably be quicker than the first.

Good luck.
 

My Computer My Computer

Computer type
PC/Desktop
OS
Windows 7 Ultimate x64
Here's what I've done in a similar situation.

I had 25,000 mp3s scattered across hundreds of artist subfolders. I wanted to get them all into one folder---mixing the Johnny Cash in with the Elvis and Sinatra, with NO subfolders at all.

I used the Everything search engine, which you already have. There are other ways, but it's the quickest method I've found.

Here's what I think should work in your case--you may have to modify it, but you'll get the idea:

Make a folder called "temp" on some drive that has enough free space to hold all of these files, including the duplicates.

Open Everything and do a search for .doc. I assume this will bring up 100% of your desired Word documents, including all the duplicates.

Highlight all of them in the Everything results window. Right click the highlighted files and choose "copy".

Navigate to the "temp" folder, right click and choose "paste". All of those files you copied in the Everything results will be copied into this folder, without any subfolders. It might appear to stall, but give it time. It might take 10 minutes or an hour or more, but the files should copy.

All files with distinct names are copied and at the end of that process you are asked if you want to overwrite those with the same names. You say, NO, copy and retain both files. The file shown earliest in the Everything list will retain its original name and the others later in the list with identical names will be named with (2), (3), etc extensions.

You'll end up with file names like xyz.doc, with the duplicates named xyz(1).doc, xyz(2).doc, xyz(3).doc and so forth.

You'd then want to keep only those without the parenthetical ending on the file name--deleting all the dupes. Leaving only xyz.doc.

You could do that with a dupe file finder or through the command line. The command line would be quickest. Navigate to the temp folder and enter a command such as "del *(1).doc". Then "del *(2).doc", etc until only those that had no parentheses remain.
 

My Computer My Computer

Computer type
PC/Desktop
Computer Manufacturer/Model Number
Ignatz Special; 4 speed manual gearbox; factory air conditioning; one of one
OS
Windows 7 Home Premium SP1, 64-bit
CPU
Intel Skylake i5-6600K, not overclocked
Motherboard
AsRock Z170M Extreme 4, micro ATX
Memory
8 GB HyperX DDR4-2666 (2 x 4 GB)
Graphics Card(s)
none; graphics are integrated on CPU
Sound Card
onboard: Realtek ALC1150; external: USB Behringer UF0-202
Monitor(s) Displays
Dell S2340M 23 inch IPS
Screen Resolution
1600 x 900
Hard Drives
System: Crucial MX100 series SSD, 128 GB;
Data: Samsung Spinpoint 103SJ, 1 TB;
Backup: WD Caviar Green WD30EZRX-00D8PB0, 3 TB
PSU
Rosewill SilentNight 500 watt fanless, semi-modular
Case
Antec Solo II
Cooling
Noctua NH-U12S; Noctua F12 intake, Noctua S12A exhaust
Keyboard
Microsoft 200 6JH-00001 USB
Mouse
Dell or Microsoft optical wired; USB
Antivirus
Microsoft Security Essentials and Malwarebytes Premium
Browser
Pale Moon
Other Info
All fans PWM; speeds at idle: CPU circa 500 rpm; intake circa 600 rpm; exhaust circa 600 rpm; CPU temps 27 idle and 47 C load in a warm room (27 C/81 F) when running Intel Extreme Tuning Utility stress test.
br4, thanks, excellent advice.

Re SyncBack, I haven't heard of that, but I'll check it out.

Your first suggestion sounds best, I have Duplicate File Finder somewhere in my archives.

But as I type this, a possible glitch occurs to me. Or maybe it's a possible advantage...

It's this: I wouldn't be able to copy all of those 30,000 docs into one folder, because Windows wouldn't allow the duplicates to exist alongside each other. Does that sound right? Or am I overlooking something obvious?

Edit: Oops, didn't see your post, ignatzatsonic. Thanks, also. I'll give that some thought today.
 

My Computer My Computer

Computer type
PC/Desktop
Computer Manufacturer/Model Number
My build
OS
Win7 Ultimate SP1
CPU
Intel Core i5 9400 Coffee Lake 14nm
Motherboard
Asus Prime H310M-E R2.0 (LGA1151)
Memory
16G DDR4
Graphics Card(s)
NVidia GeForce GTX 960
Sound Card
Realtek High Definition Audio (mobo)
Monitor(s) Displays
Asus PA248 24" 16:10 format
Screen Resolution
1920x1200@59Hz
Hard Drives
1863GB Seagate ST2000DM
3726GB Seagate ST4000DM
1863GB Seagate ST2000DM
6TB Seagate
465GB NVMe Samsung SSD 970
PSU
750G2
Case
Tower
Cooling
Standard
Keyboard
USB
Mouse
USB
Internet Speed
920Mbs/480Mbs
Antivirus
Malwarebytes
Browser
Firefox
I am not sure if this is what IGNAT said above but if you copy files from one source to another, Windows will ask "If you want to Overwrite or Keep Duplicates" and if you want to do this every time. Just check the Overwrite and it will do it every time and should be easy.
 

My Computer My Computer

Computer type
PC/Desktop
Computer Manufacturer/Model Number
Custom Build
OS
Windows 7 Home Premium 32-Bit - Build 7600 SP1
CPU
Intel Core i3-2120 3.30Ghz
Motherboard
Asus P8Z68-V LX Intel Z68 Socket H2 ATX
Memory
Kingston 4 GB DDR3 1333 mhz
Graphics Card(s)
AMD Radeon HD6670
Sound Card
Sound Blaster Audigy SE 24-Bit
Monitor(s) Displays
Asus VE228
Screen Resolution
1440 X 900
Hard Drives
OCZ Vertex 3 120 GB Sata 3 SSD ==
Kingston SH103/S3 120 G Hyper X 120 GB SSD ==
Western Digital 500 GB Caviar Green 7200 RPM ==
PSU
Corsair CX600M == 600 Watt
Case
NZXT Apollo - Silver with Clear Side Panel
Cooling
Three 120 mm Fans
Keyboard
Microsoft Natural 4000
Mouse
Microsoft Custom Optical 3000
Internet Speed
AT&T Fiber Optic Wireless Network
Antivirus
Microsoft Security Essentials
Browser
Chrome
Other Info
120 mm Blue LED Fan -- Three Blue LED Lazer Light Sticks
The 'copy and paste' approach will certainly work, but then you'll have to remove the duplicate files.

Using SyncBack will result in only one copy for each unique file name, each being the latest modified time.

Let me know if I can help.
 

My Computer My Computer

Computer type
PC/Desktop
OS
Windows 7 Ultimate x64
br4: Are you using SyncBack free or Pro?
 

My Computer My Computer

Computer type
PC/Desktop
Computer Manufacturer/Model Number
My build
OS
Win7 Ultimate SP1
CPU
Intel Core i5 9400 Coffee Lake 14nm
Motherboard
Asus Prime H310M-E R2.0 (LGA1151)
Memory
16G DDR4
Graphics Card(s)
NVidia GeForce GTX 960
Sound Card
Realtek High Definition Audio (mobo)
Monitor(s) Displays
Asus PA248 24" 16:10 format
Screen Resolution
1920x1200@59Hz
Hard Drives
1863GB Seagate ST2000DM
3726GB Seagate ST4000DM
1863GB Seagate ST2000DM
6TB Seagate
465GB NVMe Samsung SSD 970
PSU
750G2
Case
Tower
Cooling
Standard
Keyboard
USB
Mouse
USB
Internet Speed
920Mbs/480Mbs
Antivirus
Malwarebytes
Browser
Firefox
Thanks, br4. Ok, I have a copy of 7.5.19.0 portable.

How do I use it to selectively copy my docs?
 

My Computer My Computer

Computer type
PC/Desktop
Computer Manufacturer/Model Number
My build
OS
Win7 Ultimate SP1
CPU
Intel Core i5 9400 Coffee Lake 14nm
Motherboard
Asus Prime H310M-E R2.0 (LGA1151)
Memory
16G DDR4
Graphics Card(s)
NVidia GeForce GTX 960
Sound Card
Realtek High Definition Audio (mobo)
Monitor(s) Displays
Asus PA248 24" 16:10 format
Screen Resolution
1920x1200@59Hz
Hard Drives
1863GB Seagate ST2000DM
3726GB Seagate ST4000DM
1863GB Seagate ST2000DM
6TB Seagate
465GB NVMe Samsung SSD 970
PSU
750G2
Case
Tower
Cooling
Standard
Keyboard
USB
Mouse
USB
Internet Speed
920Mbs/480Mbs
Antivirus
Malwarebytes
Browser
Firefox
In SyncBack you'll be creating one or more profiles, depending how you want to do the copying, and how the files are stored on the drives (folder-wise).

Since you have files on three drives, I'd recommend creating three profiles, one for each drive. Within each profile you can set up a filter to copy only .docx files.

You can create one profile for each separate sourcing folder, or have it a a higher level. The higher the level, the more files SB has to go through, but that might not be an issue.

Be sure to create a new folder for the resulting copied files. SB does not delete any of the copied files, so nothing to worry about there.

I recommend you message me with any questions and I'll be happy to help.
 

My Computer My Computer

Computer type
PC/Desktop
OS
Windows 7 Ultimate x64
In SyncBack you'll be creating one or more profiles, depending how you want to do the copying, and how the files are stored on the drives (folder-wise).

Since you have files on three drives, I'd recommend creating three profiles, one for each drive. Within each profile you can set up a filter to copy only .docx files.

You can create one profile for each separate sourcing folder, or have it a a higher level. The higher the level, the more files SB has to go through, but that might not be an issue.

Be sure to create a new folder for the resulting copied files. SB does not delete any of the copied files, so nothing to worry about there.

I recommend you message me with any questions and I'll be happy to help.

It helps everyone if questions are answered on the public forum.
 

My Computer My Computer

Computer type
PC/Desktop
Computer Manufacturer/Model Number
Custom Build
OS
Windows 7 Home Premium 32-Bit - Build 7600 SP1
CPU
Intel Core i3-2120 3.30Ghz
Motherboard
Asus P8Z68-V LX Intel Z68 Socket H2 ATX
Memory
Kingston 4 GB DDR3 1333 mhz
Graphics Card(s)
AMD Radeon HD6670
Sound Card
Sound Blaster Audigy SE 24-Bit
Monitor(s) Displays
Asus VE228
Screen Resolution
1440 X 900
Hard Drives
OCZ Vertex 3 120 GB Sata 3 SSD ==
Kingston SH103/S3 120 G Hyper X 120 GB SSD ==
Western Digital 500 GB Caviar Green 7200 RPM ==
PSU
Corsair CX600M == 600 Watt
Case
NZXT Apollo - Silver with Clear Side Panel
Cooling
Three 120 mm Fans
Keyboard
Microsoft Natural 4000
Mouse
Microsoft Custom Optical 3000
Internet Speed
AT&T Fiber Optic Wireless Network
Antivirus
Microsoft Security Essentials
Browser
Chrome
Other Info
120 mm Blue LED Fan -- Three Blue LED Lazer Light Sticks
It depends! Windows often has duplicates in different directories as do other programs and apps. So there's no blanket rule for deleting duplicate files even some text and doc files have the same name (and size) and may not be duplicates. I use a program called dupe trasher but I use filters for files txt doc xlx etc and get it to match on name size and date so that I know that its almost a 100% duplicate but it's still tedious to check each one before deleting.
Good luck
 

My Computer My Computer

Computer type
PC/Desktop
Computer Manufacturer/Model Number
HP
OS
32bit
Thanks everyone for your inputs. I think I have enough info now to start a doc cleanup, so I'll mark the thread 'solved'.

Cheers, all :-)
 

My Computer My Computer

Computer type
PC/Desktop
Computer Manufacturer/Model Number
My build
OS
Win7 Ultimate SP1
CPU
Intel Core i5 9400 Coffee Lake 14nm
Motherboard
Asus Prime H310M-E R2.0 (LGA1151)
Memory
16G DDR4
Graphics Card(s)
NVidia GeForce GTX 960
Sound Card
Realtek High Definition Audio (mobo)
Monitor(s) Displays
Asus PA248 24" 16:10 format
Screen Resolution
1920x1200@59Hz
Hard Drives
1863GB Seagate ST2000DM
3726GB Seagate ST4000DM
1863GB Seagate ST2000DM
6TB Seagate
465GB NVMe Samsung SSD 970
PSU
750G2
Case
Tower
Cooling
Standard
Keyboard
USB
Mouse
USB
Internet Speed
920Mbs/480Mbs
Antivirus
Malwarebytes
Browser
Firefox
Back
Top