[Cuis-dev] [DEFECT] Can't zip Cuis' sources file
Gerald Klix
cuis.01 at klix.ch
Thu Nov 23 07:28:57 PST 2023
Hi Juan,
thanks a lot for your efforts.
I will test it as soon as I have checked in my latest work.
Did I get this right, this sample text ('ლორემ იფსუმ დოლორ')
is Georgian, isn't it? I wonder what it means.
Best Regards,
Gerald
On 11/23/23 3:17 PM, Juan Vuletich wrote:
> Hi Gerald,
>
> The simple fact is that ZIP files don't store "strings", they only
> store bytes. There were a few places in the Compression.pck.st that
> still made the old assumption that Strings and ByteArrays were "the
> same". I fixed those I found. I also added #addBytes:as:, that is what
> you usually want.
>
> Now, the following examples work:
>
> ZipArchive new
> zipFileComment: 'ZIP file test';
> addString: 'ლორემ იფსუმ დოლორ' as: 'someText.bin';
> writeToFileNamed: 'someText.zip'.
>
> ZipArchive new
> zipFileComment: 'ZIP file test';
> addBytes: 'ლორემ იფსუმ დოლორ' asUtf8Bytes as: 'someTextBytes2.bin';
> writeToFileNamed: 'someTextBytes2.zip'.
>
> Additionally, this example also works:
>
> ZipArchive new
> zipFileComment: 'ZIP file test';
> addFile:
> 'Cuis-Smalltalk-Dev/CoreUpdates/5922-FixedClassMethodPushDown-TomásSpognardi-2023Jul09-20h38m-TS.001.cs.st'
> as: 'testOutx.bin';
> writeToFileNamed: 'testbinx.zip'.
>
> So, I guess the problem you had with this file got also fixed along
> the way.
>
> All 3 zip files seem to unzip properly on my Mac.
>
> Please test again, and keep reporting further problems.
>
> Cheers,
>
> On 11/22/2023 8:13 AM, Gerald Klix via Cuis-dev wrote:
>> Hi Juan,
>>
>> I got tried to work around this problem by reading the
>> license files in binary mode and converting them to a string.
>> However this leads to as CRC error:
>>
>> extracting: LICENSES.txt bad CRC 46294e2b (should be
>> c42f8133)
>>
>>
>> As promised I skimmed the ZIP file spec at
>> https://pkware.cachefly.net/webdocs/casestudies/APPNOTE.TXT
>> I only saw provisions for UTF-8 file names.
>> Indeed we have problems storing such files; adding
>> this one
>> https://github.com/Cuis-Smalltalk/Cuis-Smalltalk-Dev/blob/master/CoreUpdates/5922-FixedClassMethodPushDown-Tom%C3%A1sSpognardi-2023Jul09-20h38m-TS.001.cs.st
>> yields the following error when listing the contents withunzip -l
>>
>> 1195 2023-07-19 20:39
>> Image/CoreUpdates/5922-FixedClassMethodPushDown-TomásSpognardi-2023Jul09-20h38m-TS.001.cs.s
>> error: expected central file header signature not found (file #459).
>> (please check that you have transferred or created the zipfile in the
>>
>>
>> *The biggest problem however is this one:
>>
>> *Execute the following code snippet:
>>
>> 'testIn.bin' asFileEntry forceWriteStreamDo: [ :ws |
>> ws binary.
>> ws nextPutAll: #[16rF1 16rF7] ].
>>
>>
>> ZipArchive new
>> zipFileComment: 'ZIP file test';
>> addFile: 'testIn.bin' as: 'testOut.bin';
>> writeToFileNamed: 'testbin.zip'
>>
>> Then unzip the file with:
>>
>> bear at speedy ~/s/c/Environments> unzip testbin.zip
>> Archive: testbin.zip
>> ZIP file test
>> inflating: testOut.bin
>>
>> This yields the following:
>>
>> bear at speedy ~/s/c/Environments> ls -l test*.bin
>> -rw-r--r-- 1 bear bear 2 Nov 22 12:01 testIn.bin
>> -rw-rw-rw- 1 bear bear 4 Nov 22 12:01 testOut.bin
>>
>> Invoking hexdump on both files shows a "slight" difference:
>>
>> bear at speedy ~/s/c/Environments> hexdump testIn.bin
>> 0000000 f7f1
>> 0000002
>> bear at speedy ~/s/c/Environments> hexdump testOut.bin
>> 0000000 b1c3 b7c3
>> 0000004
>>
>> Obviously some UTF8 conversion happened.
>> Needless to say that ZipArchive can't be used to compress binary
>> files and thus I can't create a distribution of Haver
>> with my current tool-set.
>>
>>
>> Best Regards,
>>
>> Gerald
>>
>>
>>
>> On 11/22/23 10:21 AM, Gerald Klix via Cuis-dev wrote:
>>> Thank's Juan,
>>>
>>> this works now.
>>>
>>> Alas when I tried to create a new Haver release I ran into another
>>> problem.
>>> I also add strings to the ZIP-archive, for example a generated file
>>> that contains
>>> all the contents of all the license files found. The resulting
>>> string contains
>>> Unicode code-points, which makes the CRC computation fail.
>>>
>>> In essence when sending #updateCrc:from:to:in: the primitive fails –
>>> for
>>> obvious reasons – and the failure code kicks in. This in turn
>>> (indirectly) sends #bitXor:
>>> to a UnicodePoint instance, which results in a DNU exception.
>>>
>>> I will try to do some research on how UTF8 data is supposed
>>> to be stored in a ZIP file.
>>>
>>>
>>> Thank's again and Best Regards,
>>>
>>> Gerald
>>>
>>>
>>>
>>> On 11/21/23 9:12 PM, Juan Vuletich wrote:
>>>> On 11/14/2023 10:20 AM, Gerald Klix via Cuis-dev wrote:
>>>>> Hi all, Hi Juan,
>>>>>
>>>>> The following snippet does not terminate (within reasonable time,
>>>>> 20h):
>>>>>
>>>>> (z1 := ZipArchive new) zipFileComment: 'ZIP test'.
>>>>> z1
>>>>> addFile: SourceFiles first fileEntry as: 'f1';
>>>>> writeToFileNamed: 'test.zip';
>>>>> close.
>>>>>
>>>>> Adding small test files, created like this: echo test > test.txt ,
>>>>> however does work:
>>>>>
>>>>> (z3 := ZipArchive new) zipFileComment: 'ZIP test'.
>>>>> z3
>>>>> addFile: Smalltalk imageName as: 'f3';
>>>>> writeToFileNamed: 'test.zip';
>>>>> close.
>>>>>
>>>>>
>>>>>
>>>>> Best Regards,
>>>>>
>>>>> Gerald
>>>>
>>>> Hi Gerald,
>>>>
>>>> I just pushed a fix to GitHub. Thanks for reporting.
>>>>
>>>> Cheers,
>>>>
>>>
>>
>
>
More information about the Cuis-dev
mailing list