Differences between revisions 42 and 43
Revision 42 as of 2008-11-08 09:33:06
Size: 1133
Editor: r11bk55
Comment: -9
Revision 43 as of 2008-11-08 10:28:07
Size: 1733
Editor: pgas
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
[[Anchor(faq19)]]
== How can I split a file into line ranges, e.g. lines 1-10, 11-20, 21-30? ==
Some Unix systems provide the {{{split}}} utility for this purpose:
Line 2: Line 5:
---- /!\ '''Edit conflict - other version:''' ----
Perfect site! We're all thinking of you here and think your efforts are incredible! recipe for coho salmon Its very well ! http://redndo.8tt.org/recipe7983.html unofficeail recipe book winter, http://list21.bplaced.net/recipe8932.html california medley soup recipe water, http://rings10.bplaced.net/recipe8113.html chocolate candy cane cookies recipe felt, http://silver6.myd.net/recipe3234.html franks red hot sauce wing recipe, See you at the latest at Le Mans.
----
CategoryHomepage
{{{
    split --lines 10 --numeric-suffixes input.txt output-
}}}
Line 7: Line 9:
---- /!\ '''Edit conflict - your version:''' ----
Perfect site! We're all thinking of you here and think your efforts are incredible! recipe for coho salmon Its very well ! http://redndo.8tt.org/recipe7983.html unofficeail recipe book winter, http://list21.bplaced.net/recipe8932.html california medley soup recipe water, http://rings10.bplaced.net/recipe8113.html chocolate candy cane cookies recipe felt, http://silver6.myd.net/recipe3234.html franks red hot sauce wing recipe, See you at the latest at Le Mans.
----
CategoryHomepage
For more flexibility you can use {{{sed}}}. The {{{sed}}} command can print e.g. the line number range 1-10:
{{{
    sed -n -e '1,10p' -e '10q'
}}}
Line 12: Line 14:
---- /!\ '''End of edit conflict''' ---- This stops {{{sed}}} from printing each line ({{{-n}}}). Instead it only processes the lines in the range 1-10 ("1,10"), and prints them ("p"). The command will quit after reading line 10 ("10q").

We can now use this to print an arbitrary range of a file (specified by line number):

{{{
# POSIX shell
file=/etc/passwd
range=10
cur=1
last=$(wc -l < "$file") # count number of lines
chunk=1
while [ $cur -lt $last ]
do
    endofchunk=$(($cur + $range - 1))
    sed -n -e "$cur,${endofchunk}p" -e "${endofchunk}q" "$file" > chunk.$(printf %04d $chunk)
    chunk=$(($chunk + 1))
    cur=$(($cur + $range))
done
}}}

The previous example uses POSIX [:ArithmeticExpression:arithmetic], which older [:BourneShell:Bourne shells] do not have. In that case the following example should be used instead:

{{{
# legacy Bourne shell; assume no printf either
file=/etc/passwd
range=10
cur=1
last=`wc -l < "$file"` # count number of lines
chunk=1
while test $cur -lt $last
do
    endofchunk=`expr $cur + $range - 1`
    sed -n -e "$cur,${endofchunk}p" -e "${endofchunk}q" "$file" > chunk.$chunk
    chunk=`expr $chunk + 1`
    cur=`expr $cur + $range`
done
}}}

Awk can also be used to produce a more or less equivalent result:

{{{
 awk -v range=10 '{print > FILENAME "." (int((NR -1)/ range)+1)}' file
}}}

Anchor(faq19)

How can I split a file into line ranges, e.g. lines 1-10, 11-20, 21-30?

Some Unix systems provide the split utility for this purpose:

    split --lines 10 --numeric-suffixes input.txt output-

For more flexibility you can use sed. The sed command can print e.g. the line number range 1-10:

    sed -n -e '1,10p' -e '10q'

This stops sed from printing each line (-n). Instead it only processes the lines in the range 1-10 ("1,10"), and prints them ("p"). The command will quit after reading line 10 ("10q").

We can now use this to print an arbitrary range of a file (specified by line number):

# POSIX shell
file=/etc/passwd
range=10
cur=1
last=$(wc -l < "$file") # count number of lines
chunk=1
while [ $cur -lt $last ]
do
    endofchunk=$(($cur + $range - 1))
    sed -n -e "$cur,${endofchunk}p" -e "${endofchunk}q" "$file" > chunk.$(printf %04d $chunk)
    chunk=$(($chunk + 1))
    cur=$(($cur + $range))
done

The previous example uses POSIX [:ArithmeticExpression:arithmetic], which older [:BourneShell:Bourne shells] do not have. In that case the following example should be used instead:

# legacy Bourne shell; assume no printf either
file=/etc/passwd
range=10
cur=1
last=`wc -l < "$file"` # count number of lines
chunk=1
while test $cur -lt $last
do
    endofchunk=`expr $cur + $range - 1`
    sed -n -e "$cur,${endofchunk}p" -e "${endofchunk}q" "$file" > chunk.$chunk
    chunk=`expr $chunk + 1`
    cur=`expr $cur + $range`
done

Awk can also be used to produce a more or less equivalent result:

 awk -v range=10 '{print > FILENAME "." (int((NR -1)/ range)+1)}' file

BashFAQ/019 (last edited 2022-04-19 12:13:19 by emanuele6)