Put the knife down and take a green herb, dude. (c) Ruffin Bailey 2001-2021

I've recently come up against a file of sql that's too big for SQuirreL SQL to handle... I keep getting a Java Memory Heap error when I try and paste it all in. The quick answer appears to be csplit. To break a giant file into files of 10,000 lines each, you can call this:

csplit tokenLinks.sql 10000 {100} <<< Don't use!

That basically says to chunk the sql file into 100 groups of 10,000 lines each.

The problem here is if your file doesn't have 100 groups of 10,000 lines... I used 100 b/c the number of lines often changed from file to file, and I didn't want to have long last files if I pitched low. That is, 15 groups of 10,000 lines isn't really enough if the file's got 200k lines.

So csplit gives you an "out of range" error if you shoot too high and then, get this, erasing all the files it made. Nice. So it's worthless.

Not so fast. [Chapter 35] 35.10 Splitting Files by Context: csplit:

Unfortunately, if you tell csplit to create more files than it's able to, this produces an 'out of range' error. Furthermore, when csplit encounters an error, it exits by removing any files it created along the way. (A bug, if you ask me.) This is where the -k option comes in. Specify -k to keep the files around, even when the 'out of range' message occurs.

csplit -k tokenLinks.sql 10000 {100}

Happy and reponsitive [sic]. And, at least on my iBook running 10.4, the last file does have the last entry, so nothing's missed.

Labels: problem solved

title: Put the knife down and take a green herb, dude.	descrip: One feller's views on the state of everyday computer science & its application (and now, OTHER STUFF) who isn't rich enough to shell out for www.myfreakinfirst-andlast-name.com Using 89% of the same design the blog had in 2001.
FOR ENTERTAINMENT PURPOSES ONLY!!! Back-up your data and, when you bike, always wear white. As an Amazon Associate, I earn from qualifying purchases. Affiliate links in green.
x MarkUpDown is the best Markdown editor for professionals on Windows 10. It includes two-pane live preview, in-app uploads to imgur for image hosting, and MultiMarkdown table support. Features you won't find anywhere else include... MarkUpDown Multiline Table & Bootstrap Grid support. Beautiful Easy Actions that keep the Markdown flowing. HTML paste to paste HTML source into your documents. You've wasted more than $15 of your time looking for a great Markdown editor. Stop looking. MarkUpDown is the app you're looking for. Learn more or head over to the 'Store now!

Sunday, December 13, 2009
Getting around csplit's "out of range" I've recently come up against a file of sql that's too big for SQuirreL SQL to handle... I keep getting a Java Memory Heap error when I try and paste it all in. The quick answer appears to be `csplit`. To break a giant file into files of 10,000 lines each, you can call this: `csplit tokenLinks.sql 10000 {100}` <<< Don't use! That basically says to chunk the sql file into 100 groups of 10,000 lines each. The problem here is if your file doesn't have 100 groups of 10,000 lines... I used 100 b/c the number of lines often changed from file to file, and I didn't want to have long last files if I pitched low. That is, 15 groups of 10,000 lines isn't really enough if the file's got 200k lines. So csplit gives you an "out of range" error if you shoot too high and then, get this, erasing all the files it made. Nice. So it's worthless. Not so fast. [Chapter 35] 35.10 Splitting Files by Context: csplit: Unfortunately, if you tell csplit to create more files than it's able to, this produces an 'out of range' error. Furthermore, when csplit encounters an error, it exits by removing any files it created along the way. (A bug, if you ask me.) This is where the -k option comes in. Specify -k to keep the files around, even when the 'out of range' message occurs. `csplit -k tokenLinks.sql 10000 {100}` Happy and reponsitive [sic]. And, at least on my iBook running 10.4, the last file does have the last entry, so nothing's missed. Labels: problem solved posted by ruffin at 12/13/2009 03:03:00 PM

<< Older \| Newer >>