"Copy" command in command prompt generates 0x1a artifact?


  1. Posts : 40
    Two soup cans and some string.
       #1

    "Copy" command in command prompt generates 0x1a artifact?


    Hello Everybody!

    I don't know if this is a real issue, or just one of those "What the . . . .!!" situations. You judge.

    Situation:
    I was working on a batch file that combines two .txt (ASCII?) files created in Notepad to create a composite file, and I noticed something really interesting. (And yes, when someone talks about doing something and the result is "really interesting", it's time to duck and hunt cover! )

    Viz.:
    Assume that there are two text files: "This.txt" and "That.txt", and they contain the text "This[cr-lf]" and "That[cr-lf]", where [cr-lf] is used to indicate a newline generated by hitting the "Enter/Return" key on the keyboard.

    If you look at the files with a hex editor, what you see is something like this:
    The file "This.txt" contains the text "This" followed by 0x0D0A, which are the ascii codes for CR and LF.
    The file "That.txt" contains the text "That" followed by the same CR and LF characters.
    Note that these two files contain nothing else.

    Issue I discovered:

    • When I execute the command copy /y ".\This.txt"+".\That.txt" ".\This&That.txt", I do not get the simple concatenation of the two files, which is what I would expect. Instead I get the two files concatenated, (including the trailing CR/LF's as I would expect), but also one additional character, the hex character for a right-arrow symbol - 0x1A. In essence I get the following:
      This[0x0D0A]
      That[0x0D0A1A]
    • If I perform the same copy, and signal that the copy is an "ascii" copy, (copy /a /y), I get the exact same result, including the trailing 0x1A.
    • If, and only if, I signal the copy as a "binary" copy, (copy /b /y), do I get the two files copied together without the trailing 0x1A.

    I have absolutely no idea what is going on here, or why this happens.

    I have tested it on two different machines, one running Win7 64 bit Professional on a domain, and another system running Win7 64 bit Home Premium on a Toshiba laptop.

    I am attaching two ZIP archives:

    • The first, titled "this&that.zip", contains the three files: This.txt, That.txt. and the batch file "this&that.bat" that performs the three copy operations noted above.
    • The second, titled "this&that results.zip" has the three output files "this&that.txt", "this&that_ascii.txt", and "this&that_binary.txt"
    • I am also including two screen snips showing the ascii and hex results for the "this&that.txt" file.


    You can open the three result files in either a hex editor, or notepad, to see what I see. If you use a hex editor, you will see the trailing 0x1A on the first two files. If you open them in Notepad, you will see a small right-arrow character at the very bottom of the first two files.

    I do not know if this is a "bug" or a "feature", ( ), as I would expect either a "simple" copy, or (especially!), an ASCII copy to not generate spurious artifacts.

    If someone could help me understand what is happening here, and (even better), why it's happening this way, I would gratefully appreciate it.

    Thanks again for all your help!

    Jim (JR)
    Attached Thumbnails Attached Thumbnails "Copy" command in command prompt generates 0x1a artifact?-ascii.jpg   "Copy" command in command prompt generates 0x1a artifact?-hex.jpg  
    "Copy" command in command prompt generates 0x1a artifact? Attached Files
    Last edited by jharris1993; 02 Mar 2015 at 17:01. Reason: Grammer (Doh!)
      My Computer


  2. Posts : 1,519
    Windows 7 Ultimate 64-bit, Windows 8.1 64-bit, Mac OS X 10.10, Linux Mint 17, Windows 10 Pro TP
       #2

    I was trying to remember from my DOS days but drew a blank. A Google Search brought up a possible solution:
    ms dos - In MS DOS copying several files to one file - Stack Overflow
      My Computer


  3. Posts : 40
    Two soup cans and some string.
    Thread Starter
       #3

    Berton said:
    I was trying to remember from my DOS days but drew a blank. A Google Search brought up a possible solution:
    ms dos - In MS DOS copying several files to one file - Stack Overflow
    Berton,

    You saw everything I saw, and that article doesn't say anything new that you or I don't already know.

    These instructions go all the way back to DOS 4 and 5, maybe even earlier. Unfortunately, I don't remember this kind of artifact happening before, and I have tried concatenating text files. Maybe I'm just loosing my grip, or what. I sure don't know.

    Maybe someone else out there in Television Land has an idea?

    Maybe I need to bring up XP in a virtual machine and try it?

    Maybe I should get out more often? ( :laughing: )

    What say ye?

    Jim (JR)
      My Computer


  4. Posts : 2,468
    Windows 7 Ultimate x64
       #4

    Look at here:
    Batch script to merge files without Hex char 1A at the end - Stack Overflow

    Apparently, the position of the switched matters, because in your samples, only the inputs are considered binary, while the output is still plain text. This causes the output of the end-of-file character (0x1A).
      My Computer


  5. Posts : 40
    Two soup cans and some string.
    Thread Starter
       #5

    Alejandro85 said:
    Look at here:
    Batch script to merge files without Hex char 1A at the end - Stack Overflow

    Apparently, the position of the switched matters, because in your samples, only the inputs are considered binary, while the output is still plain text. This causes the output of the end-of-file character (0x1A).
    Alejandro,

    Thanks for an excellent link! +1 for you.

    The "help" (/? switch) for the copy command gives this:
    COPY [/D] [/V] [/N] [/Y | /-Y] [/Z] [/L] [/A | /B ] source [/A | /B]
    [+ source [/A | /B] [+ ...]] [destination [/A | /B]]
    . . . and yes, you can specify the format of an individual file by appending the appropriate switch after it. However, (silly me!), I "assumed" that the ability to specify either /A or /B before any of the file names (see emphasis in red), meant that this would be applied globally. (i.e. Specifying either /A or /B before any of the file names means that the switch should apply to all of the files in the list.)

    Also, the post you mention indicates that the EOF (0x1A) should be added after a binary file but not after an ascii file. In my case, it is only by selecting "binary" as a global file type that I avoid the added characters.

    I do not want to sound like I am trying to discredit you, or your efforts. They are appreciated. Unfortunately this article seems to contradict what is actually happening.

    And I am still puzzled.

    What say ye?

    Jim (JR)
      My Computer


  6. Posts : 721
    Windows 10, Windows 8.1 Pro, Windows 7 Professional, OS X El Capitan
       #6

    I say ditch the copy command altogether and instead use type along with redirection to concatenate txt files.

    E.g.
    Code:
    (type This.txt & type That.txt) > "This&That.txt"
      My Computer


  7. Posts : 5,642
    Windows 10 Pro (x64)
       #7

    Could also use Powershell.
    Code:
    Get-Content .\this.txt,.\that.txt | Out-File .\this-that.txt
      My Computer


 

  Related Discussions
Our Sites
Site Links
About Us
Windows 7 Forums is an independent web site and has not been authorized, sponsored, or otherwise approved by Microsoft Corporation. "Windows 7" and related materials are trademarks of Microsoft Corp.

© Designer Media Ltd
All times are GMT -5. The time now is 19:29.
Find Us