Mercurial > forge
changeset 11817:8e4b2cf424b2 octave-forge
bioinfo: moved to a separate mercurial repo
author | carandraug |
---|---|
date | Mon, 17 Jun 2013 16:49:15 +0000 |
parents | ef0edb527738 |
children | e9cefd6a4c96 |
files | main/bioinfo/COPYING main/bioinfo/DESCRIPTION main/bioinfo/inst/aa2int.m main/bioinfo/inst/aminolookup.m main/bioinfo/inst/cleave.m main/bioinfo/inst/int2aa.m main/bioinfo/inst/seqreverse.m |
diffstat | 7 files changed, 0 insertions(+), 795 deletions(-) [+] |
line wrap: on
line diff
--- a/main/bioinfo/COPYING Mon Jun 17 16:08:09 2013 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,339 +0,0 @@ - GNU GENERAL PUBLIC LICENSE - Version 2, June 1991 - - Copyright (C) 1989, 1991 Free Software Foundation, Inc. - 675 Mass Ave, Cambridge, MA 02139, USA - Everyone is permitted to copy and distribute verbatim copies - of this license document, but changing it is not allowed. - - Preamble - - The licenses for most software are designed to take away your -freedom to share and change it. By contrast, the GNU General Public -License is intended to guarantee your freedom to share and change free -software--to make sure the software is free for all its users. This -General Public License applies to most of the Free Software -Foundation's software and to any other program whose authors commit to -using it. (Some other Free Software Foundation software is covered by -the GNU Library General Public License instead.) You can apply it to -your programs, too. - - When we speak of free software, we are referring to freedom, not -price. Our General Public Licenses are designed to make sure that you -have the freedom to distribute copies of free software (and charge for -this service if you wish), that you receive source code or can get it -if you want it, that you can change the software or use pieces of it -in new free programs; and that you know you can do these things. - - To protect your rights, we need to make restrictions that forbid -anyone to deny you these rights or to ask you to surrender the rights. -These restrictions translate to certain responsibilities for you if you -distribute copies of the software, or if you modify it. - - For example, if you distribute copies of such a program, whether -gratis or for a fee, you must give the recipients all the rights that -you have. You must make sure that they, too, receive or can get the -source code. And you must show them these terms so they know their -rights. - - We protect your rights with two steps: (1) copyright the software, and -(2) offer you this license which gives you legal permission to copy, -distribute and/or modify the software. - - Also, for each author's protection and ours, we want to make certain -that everyone understands that there is no warranty for this free -software. If the software is modified by someone else and passed on, we -want its recipients to know that what they have is not the original, so -that any problems introduced by others will not reflect on the original -authors' reputations. - - Finally, any free program is threatened constantly by software -patents. We wish to avoid the danger that redistributors of a free -program will individually obtain patent licenses, in effect making the -program proprietary. To prevent this, we have made it clear that any -patent must be licensed for everyone's free use or not licensed at all. - - The precise terms and conditions for copying, distribution and -modification follow. - - GNU GENERAL PUBLIC LICENSE - TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION - - 0. This License applies to any program or other work which contains -a notice placed by the copyright holder saying it may be distributed -under the terms of this General Public License. The "Program", below, -refers to any such program or work, and a "work based on the Program" -means either the Program or any derivative work under copyright law: -that is to say, a work containing the Program or a portion of it, -either verbatim or with modifications and/or translated into another -language. (Hereinafter, translation is included without limitation in -the term "modification".) Each licensee is addressed as "you". - -Activities other than copying, distribution and modification are not -covered by this License; they are outside its scope. The act of -running the Program is not restricted, and the output from the Program -is covered only if its contents constitute a work based on the -Program (independent of having been made by running the Program). -Whether that is true depends on what the Program does. - - 1. You may copy and distribute verbatim copies of the Program's -source code as you receive it, in any medium, provided that you -conspicuously and appropriately publish on each copy an appropriate -copyright notice and disclaimer of warranty; keep intact all the -notices that refer to this License and to the absence of any warranty; -and give any other recipients of the Program a copy of this License -along with the Program. - -You may charge a fee for the physical act of transferring a copy, and -you may at your option offer warranty protection in exchange for a fee. - - 2. You may modify your copy or copies of the Program or any portion -of it, thus forming a work based on the Program, and copy and -distribute such modifications or work under the terms of Section 1 -above, provided that you also meet all of these conditions: - - a) You must cause the modified files to carry prominent notices - stating that you changed the files and the date of any change. - - b) You must cause any work that you distribute or publish, that in - whole or in part contains or is derived from the Program or any - part thereof, to be licensed as a whole at no charge to all third - parties under the terms of this License. - - c) If the modified program normally reads commands interactively - when run, you must cause it, when started running for such - interactive use in the most ordinary way, to print or display an - announcement including an appropriate copyright notice and a - notice that there is no warranty (or else, saying that you provide - a warranty) and that users may redistribute the program under - these conditions, and telling the user how to view a copy of this - License. (Exception: if the Program itself is interactive but - does not normally print such an announcement, your work based on - the Program is not required to print an announcement.) - -These requirements apply to the modified work as a whole. If -identifiable sections of that work are not derived from the Program, -and can be reasonably considered independent and separate works in -themselves, then this License, and its terms, do not apply to those -sections when you distribute them as separate works. But when you -distribute the same sections as part of a whole which is a work based -on the Program, the distribution of the whole must be on the terms of -this License, whose permissions for other licensees extend to the -entire whole, and thus to each and every part regardless of who wrote it. - -Thus, it is not the intent of this section to claim rights or contest -your rights to work written entirely by you; rather, the intent is to -exercise the right to control the distribution of derivative or -collective works based on the Program. - -In addition, mere aggregation of another work not based on the Program -with the Program (or with a work based on the Program) on a volume of -a storage or distribution medium does not bring the other work under -the scope of this License. - - 3. You may copy and distribute the Program (or a work based on it, -under Section 2) in object code or executable form under the terms of -Sections 1 and 2 above provided that you also do one of the following: - - a) Accompany it with the complete corresponding machine-readable - source code, which must be distributed under the terms of Sections - 1 and 2 above on a medium customarily used for software interchange; or, - - b) Accompany it with a written offer, valid for at least three - years, to give any third party, for a charge no more than your - cost of physically performing source distribution, a complete - machine-readable copy of the corresponding source code, to be - distributed under the terms of Sections 1 and 2 above on a medium - customarily used for software interchange; or, - - c) Accompany it with the information you received as to the offer - to distribute corresponding source code. (This alternative is - allowed only for noncommercial distribution and only if you - received the program in object code or executable form with such - an offer, in accord with Subsection b above.) - -The source code for a work means the preferred form of the work for -making modifications to it. For an executable work, complete source -code means all the source code for all modules it contains, plus any -associated interface definition files, plus the scripts used to -control compilation and installation of the executable. However, as a -special exception, the source code distributed need not include -anything that is normally distributed (in either source or binary -form) with the major components (compiler, kernel, and so on) of the -operating system on which the executable runs, unless that component -itself accompanies the executable. - -If distribution of executable or object code is made by offering -access to copy from a designated place, then offering equivalent -access to copy the source code from the same place counts as -distribution of the source code, even though third parties are not -compelled to copy the source along with the object code. - - 4. You may not copy, modify, sublicense, or distribute the Program -except as expressly provided under this License. Any attempt -otherwise to copy, modify, sublicense or distribute the Program is -void, and will automatically terminate your rights under this License. -However, parties who have received copies, or rights, from you under -this License will not have their licenses terminated so long as such -parties remain in full compliance. - - 5. You are not required to accept this License, since you have not -signed it. However, nothing else grants you permission to modify or -distribute the Program or its derivative works. These actions are -prohibited by law if you do not accept this License. Therefore, by -modifying or distributing the Program (or any work based on the -Program), you indicate your acceptance of this License to do so, and -all its terms and conditions for copying, distributing or modifying -the Program or works based on it. - - 6. Each time you redistribute the Program (or any work based on the -Program), the recipient automatically receives a license from the -original licensor to copy, distribute or modify the Program subject to -these terms and conditions. You may not impose any further -restrictions on the recipients' exercise of the rights granted herein. -You are not responsible for enforcing compliance by third parties to -this License. - - 7. If, as a consequence of a court judgment or allegation of patent -infringement or for any other reason (not limited to patent issues), -conditions are imposed on you (whether by court order, agreement or -otherwise) that contradict the conditions of this License, they do not -excuse you from the conditions of this License. If you cannot -distribute so as to satisfy simultaneously your obligations under this -License and any other pertinent obligations, then as a consequence you -may not distribute the Program at all. For example, if a patent -license would not permit royalty-free redistribution of the Program by -all those who receive copies directly or indirectly through you, then -the only way you could satisfy both it and this License would be to -refrain entirely from distribution of the Program. - -If any portion of this section is held invalid or unenforceable under -any particular circumstance, the balance of the section is intended to -apply and the section as a whole is intended to apply in other -circumstances. - -It is not the purpose of this section to induce you to infringe any -patents or other property right claims or to contest validity of any -such claims; this section has the sole purpose of protecting the -integrity of the free software distribution system, which is -implemented by public license practices. Many people have made -generous contributions to the wide range of software distributed -through that system in reliance on consistent application of that -system; it is up to the author/donor to decide if he or she is willing -to distribute software through any other system and a licensee cannot -impose that choice. - -This section is intended to make thoroughly clear what is believed to -be a consequence of the rest of this License. - - 8. If the distribution and/or use of the Program is restricted in -certain countries either by patents or by copyrighted interfaces, the -original copyright holder who places the Program under this License -may add an explicit geographical distribution limitation excluding -those countries, so that distribution is permitted only in or among -countries not thus excluded. In such case, this License incorporates -the limitation as if written in the body of this License. - - 9. The Free Software Foundation may publish revised and/or new versions -of the General Public License from time to time. Such new versions will -be similar in spirit to the present version, but may differ in detail to -address new problems or concerns. - -Each version is given a distinguishing version number. If the Program -specifies a version number of this License which applies to it and "any -later version", you have the option of following the terms and conditions -either of that version or of any later version published by the Free -Software Foundation. If the Program does not specify a version number of -this License, you may choose any version ever published by the Free Software -Foundation. - - 10. If you wish to incorporate parts of the Program into other free -programs whose distribution conditions are different, write to the author -to ask for permission. For software which is copyrighted by the Free -Software Foundation, write to the Free Software Foundation; we sometimes -make exceptions for this. Our decision will be guided by the two goals -of preserving the free status of all derivatives of our free software and -of promoting the sharing and reuse of software generally. - - NO WARRANTY - - 11. BECAUSE THE PROGRAM IS LICENSED FREE OF CHARGE, THERE IS NO WARRANTY -FOR THE PROGRAM, TO THE EXTENT PERMITTED BY APPLICABLE LAW. EXCEPT WHEN -OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES -PROVIDE THE PROGRAM "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED -OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF -MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE ENTIRE RISK AS -TO THE QUALITY AND PERFORMANCE OF THE PROGRAM IS WITH YOU. SHOULD THE -PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING, -REPAIR OR CORRECTION. - - 12. IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING -WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY AND/OR -REDISTRIBUTE THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES, -INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING -OUT OF THE USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED -TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY -YOU OR THIRD PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER -PROGRAMS), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE -POSSIBILITY OF SUCH DAMAGES. - - END OF TERMS AND CONDITIONS - - How to Apply These Terms to Your New Programs - - If you develop a new program, and you want it to be of the greatest -possible use to the public, the best way to achieve this is to make it -free software which everyone can redistribute and change under these terms. - - To do so, attach the following notices to the program. It is safest -to attach them to the start of each source file to most effectively -convey the exclusion of warranty; and each file should have at least -the "copyright" line and a pointer to where the full notice is found. - - <one line to give the program's name and a brief idea of what it does.> - Copyright (C) 19yy <name of author> - - This program is free software; you can redistribute it and/or modify - it under the terms of the GNU General Public License as published by - the Free Software Foundation; either version 2 of the License, or - (at your option) any later version. - - This program is distributed in the hope that it will be useful, - but WITHOUT ANY WARRANTY; without even the implied warranty of - MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the - GNU General Public License for more details. - - You should have received a copy of the GNU General Public License - along with this program; if not, write to the Free Software - Foundation, Inc., 675 Mass Ave, Cambridge, MA 02139, USA. - -Also add information on how to contact you by electronic and paper mail. - -If the program is interactive, make it output a short notice like this -when it starts in an interactive mode: - - Gnomovision version 69, Copyright (C) 19yy name of author - Gnomovision comes with ABSOLUTELY NO WARRANTY; for details type `show w'. - This is free software, and you are welcome to redistribute it - under certain conditions; type `show c' for details. - -The hypothetical commands `show w' and `show c' should show the appropriate -parts of the General Public License. Of course, the commands you use may -be called something other than `show w' and `show c'; they could even be -mouse-clicks or menu items--whatever suits your program. - -You should also get your employer (if you work as a programmer) or your -school, if any, to sign a "copyright disclaimer" for the program, if -necessary. Here is a sample; alter the names: - - Yoyodyne, Inc., hereby disclaims all copyright interest in the program - `Gnomovision' (which makes passes at compilers) written by James Hacker. - - <signature of Ty Coon>, 1 April 1989 - Ty Coon, President of Vice - -This General Public License does not permit incorporating your program into -proprietary programs. If your program is a subroutine library, you may -consider it more useful to permit linking proprietary applications with the -library. If this is what you want to do, use the GNU Library General -Public License instead of this License.
--- a/main/bioinfo/DESCRIPTION Mon Jun 17 16:08:09 2013 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,12 +0,0 @@ -Name: bioinfo -Version: 0.1.2 -Date: 2009-05-06 -Author: Bill Denney <bill@denney.ws> -Maintainer: Octave Community -Title: Bioinformatics -Description: Bioinformatics manipulation -Categories: Bioinformatics -Depends: octave (>= 3.0.0) -Autoload: yes -License: GPL version 2 or later -Url: http://octave.sf.net
--- a/main/bioinfo/inst/aa2int.m Mon Jun 17 16:08:09 2013 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,49 +0,0 @@ -## Copyright (C) 2008 Bill Denney -## -## This software is free software; you can redistribute it and/or modify it -## under the terms of the GNU General Public License as published by -## the Free Software Foundation; either version 3 of the License, or (at -## your option) any later version. -## -## This software is distributed in the hope that it will be useful, but -## WITHOUT ANY WARRANTY; without even the implied warranty of -## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU -## General Public License for more details. -## -## You should have received a copy of the GNU General Public License -## along with this software; see the file COPYING. If not, see -## <http://www.gnu.org/licenses/>. - -## -*- texinfo -*- -## @deftypefn {Function File} {@var{sequenceint} =} aa2int (@var{sequencechar}) -## Convert amino acid characters into integers. See @code{aminolookup} -## for the listing of numbers and their corresponding characters. -## @seealso{aminolookup,int2aa,int2nt,nt2int} -## @end deftypefn - -## Author: Bill Denney <bill@denney.ws> - -function seqi = aa2int (seqc=0) - - if (! ischar (seqc)) - print_usage (); - endif - sequences = aminolookup ("code", []); - - seqi = -ones (size (seqc)); - seqc = upper (seqc); - for i = 1:numel (sequences) - seqi(sequences(i) == seqc) = i; - endfor - if any (seqi(:) < 0) - idx = find (seqi(:) < 0, 1); - error ("aa2int: unrecognised character (%s) in sequence input at position (%d)", - seqc(idx), idx) - endif - -endfunction - -## Tests -%!assert (aa2int ("a"), 1) -%!assert (aa2int ("ar"), [1 2]) -%!assert (aa2int (["ar"]'), [1;2])
--- a/main/bioinfo/inst/aminolookup.m Mon Jun 17 16:08:09 2013 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,224 +0,0 @@ -## Copyright (C) 2008 Bill Denney -## -## This software is free software; you can redistribute it and/or modify it -## under the terms of the GNU General Public License as published by -## the Free Software Foundation; either version 3 of the License, or (at -## your option) any later version. -## -## This software is distributed in the hope that it will be useful, but -## WITHOUT ANY WARRANTY; without even the implied warranty of -## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU -## General Public License for more details. -## -## You should have received a copy of the GNU General Public License -## along with this software; see the file COPYING. If not, see -## <http://www.gnu.org/licenses/>. - -## -*- texinfo -*- -## @deftypefn {Function File} {} aminolookup () -## @deftypefnx {Function File} {@var{aminodesc} =} aminolookup (@var{seq}) -## @deftypefnx {Function File} {@var{aminodesc} =} aminolookup (@var{searchtype}, @var{seq}) -## Convert between amino acid representations. The types of input are -## -## @itemize @bullet -## @item Name -## -## The amino acid name -## @item Code -## -## The amino acid single letter code -## @item Abbreviation -## -## The three letter abbreviation for the amino acid -## @item Integer -## -## The number representation of the amino acid -## @end itemize -## -## To see the full list of each of the above, run this function without -## any arguments or outputs. -## -## If called without zero inputs, this will display the mapping between -## the above types. If called with one input, it will convert to code by default. -## @seealso{aa2int,int2aa,int2nt,nt2int} -## @end deftypefn - -## Author: Bill Denney <bill@denney.ws> - -function result = aminolookup (varargin) - - persistent data - - if isempty (data) - data.code = "ARNDCQEGHILKMFPSTWYVBZX*-"; - data.num = 1:25; - - data.abbr = {"Ala" "Arg" "Asn" "Asp" "Cys" "Gln" "Glu" "Gly" "His" \ - "Ile" "Leu" "Lys" "Met" "Phe" "Pro" "Ser" "Thr" "Trp" \ - "Tyr" "Val" "Asx" "Glx" "Xaa" "END" "GAP"}; - - data.name = cell (numel (data.code), 3); - data.name(:,1) = {"Alanine" "Arginine" "Asparagine" "Aspartic acid" \ - "Cysteine" "Glutamine" "Glutamic acid" "Glycine" \ - "Histidine" "Isoleucine" "Leucine" "Lysine" \ - "Methionine" "Phenylalanine" "Proline" "Serine" \ - "Threonine" "Tryptophan" "Tyrosine" "Valine" \ - "Asparagine" "Glutamine" "Any amino acid" \ - "Termination codon (translation stop)" \ - "Gap of unknown length"}'; - ## Alternate names - data.name{4,2} = "Aspartate"; - data.name{7,2} = "Glutamate"; - data.name{21,2} = "Aspartic acid"; - data.name{22,2} = "Glutamic acid"; - data.name{21,3} = "Aspartate"; - data.name{22,3} = "Glutamate"; - for i = 1:numel (data.name) - if isempty (data.name{i}) - data.name{i} = ""; - endif - endfor - - data.seq = {"GCU GCC GCA GCG" "CGU CGC CGA CGG AGA AGG" "AAU AAC" \ - "GAU GAC" "UGU UGC" "CAA CAG" "GAA GAG" \ - "GGU GGC GGA GGG" "CAU CAC" "AUU AUC AUA" \ - "UUA UUG CUU CUC CUA CUG" "AAA AAG" "AUG" "UUU UUC" \ - "CCU CCC CCA CCG" "UCU UCC UCA UCG AGU AGC" \ - "ACU ACC ACA ACG" "UGG" "UAU UAC" "GUU GUC GUA GUG" \ - "AAU AAC GAU GAC" "CAA CAG GAA GAG" "All codons" \ - "UAA UAG UGA" "NA"}; - endif - - searchtype = ""; - value = ""; - - if (nargin == 0) - n = cell (rows (data.name), 4); - n(:,1) = data.name(:,1); - for i = 1:rows (data.name) - for j = 2:columns (data.name) - if (! isempty (data.name{i,j})) - n{i,1} = sprintf ("%s or %s", n{i,1}, data.name{i,j}); - endif - endfor - n{i,2} = num2str (i); - n{i,3} = data.code(i); - n{i,4} = data.seq{i}; - endfor - s = max (cellfun (@numel, n)); - fmt = sprintf ("%%-%ds ", s(1), s(2), s(3), s(4)); - for i = 1:rows (n) - printf ([fmt "\n"], n{i,:}); - endfor - elseif (nargin == 1) - showmany = 1; - if isnumeric (varargin{1}) - ## this is an extension of the matlab options - searchtype = "integer"; - elseif ischar (varargin{1}) - if ((mod (numel (varargin{1}), 3) == 0) && - (find (isupper (varargin{1})) == 1:3:numel (varargin{1}))) - ## if the number of characters is divisible by 3 and exactly - ## every third character is upper case - searchtype = "abbreviation"; - else - searchtype = "code"; - endif - endif - value = varargin{1}; - else - showmany = 0; - searchtype = lower (varargin{1}); - value = varargin{2}; - endif - - if (rows (value) > 1) - error ("aminolookup: value may only be one row") - endif - - if (showmany == 1) - ## we need to convert many inputs into one output. First convert - ## the input value into the integer form. - switch lower (searchtype) - case "code" - value = upper (value); - newvalue = -ones (size (value)); - for i = 1:numel (data.code) - newvalue(value(:) == data.code(i)) = i; - endfor - outtype = "abbreviation"; - case "abbreviation" - newvalue = -ones (1, numel (value)/3); - for i = 1:3:numel (value) - newvalue((i-1)/3+1) = find (strcmp (value(i:i+2), data.abbr), 1); - endfor - outtype = "code"; - case "integer" - newvalue = value; - outtype = "code"; - otherwise - error (["aninolookup: cannot convert multiple arguments of any type\n" - "but code, abbreviation, or integer"]); - endswitch - if (any (newvalue(:)) < 0) - idx = find ((newvalue < 0) | (newvalue > numel (data.num)), 1); - error ("aminolookup: unrecognised symbol in input at position %d", idx); - endif - - switch outtype - case "code" - result = data.code(newvalue); - case "abbreviation" - result = strcat(data.abbr{newvalue}); - otherwise - error ("aminolookup: invalid output type") - endswitch - elseif (showmany == 0) - ## we're only showing one value - switch lower (searchtype) - case "integer" - if isempty (value) - result = ints; - else - result = sprintf ("%s %s %s", data.code(value), data.abbr(value), data.name(value)); - endif - case "code" - if isempty (value) - result = data.code; - else - idx = find (lower (value) == lower(data.code), 1); - result = sprintf ("%s %s", data.abbr{idx}, data.name{idx}); - endif - case "abbreviation" - if isempty (value) - result = data.abbr; - else - idx = find (strcmpi (value, data.abbr)); - result = sprintf ("%s %s", data.code(idx), data.name{idx}); - endif - case "name" - if isempty (value) - result = data.name; - else - [idx whocares] = find (strcmpi (value, data.name), 1); - result = sprintf ("%s %s", data.code(idx), data.abbr{idx}); - endif - otherwise - error ("aminolookup: invalid search type, %s", searchtype) - endswitch - endif - -endfunction - -## Tests -%!shared code, abbr, ints -%! code = "MWKQAEDIRDIYDF"; -%! abbr = "MetTrpLysGlnAlaGluAspIleArgAspIleTyrAspPhe"; -%! ints = [13 18 12 6 1 7 4 10 2 4 10 19 4 14]; -%!assert (aminolookup(code), abbr) -%!assert (aminolookup(abbr), code) -%!assert (aminolookup(ints), code) -%!assert (aminolookup("Code", "R"), "Arg Arginine") -%!assert (aminolookup("Integer", 1), "A Ala Alanine") -%!assert (aminolookup("Abbreviation", "asn"), "N Asparagine") -%!assert (aminolookup("Name", "proline"), "P Pro")
--- a/main/bioinfo/inst/cleave.m Mon Jun 17 16:08:09 2013 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,86 +0,0 @@ -## Copyright (C) 2008 Bill Denney -## -## This software is free software; you can redistribute it and/or modify it -## under the terms of the GNU General Public License as published by -## the Free Software Foundation; either version 3 of the License, or (at -## your option) any later version. -## -## This software is distributed in the hope that it will be useful, but -## WITHOUT ANY WARRANTY; without even the implied warranty of -## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU -## General Public License for more details. -## -## You should have received a copy of the GNU General Public License -## along with this software; see the file COPYING. If not, see -## <http://www.gnu.org/licenses/>. - -## -*- texinfo -*- -## @deftypefn {Function File} {@var{fragments} =} cleave (@var{sequence}, @var{pattern}, @var{position}) -## @deftypefnx {Function File} {[@var{fragments}, @var{cuttingsites}] =} cleave (@dots{}) -## @deftypefnx {Function File} {[@var{fragments}, @var{cuttingsites}, @var{lengths}] =} cleave (@dots{}) -## @deftypefnx {Function File} {[@dots{}] =} cleave (@dots{}, "PartialDigest", @var{prob}) -## Cleave a peptide @var{sequence} using the @var{pattern} at the -## @var{position} relative to the pattern. The @var{sequence} is a -## sequence of amino acids; the @var{pattern} is a regular expression to -## find the location of the cleavage; and the @var{position} is the -## position relative to that regular expression (0 is immediately to the -## left of the first character, 1 is immediately to the right of the -## first character, @dots{}). -## -## Example regular expressions for some commone proteases are given -## below. -## -## @multitable @columnfractions .5 .3 .2 -## @item Protease @tab Peptide Pattern @tab Position -## @item Trypsin @tab [KR](?!P) @tab 1 -## @item Chymotrypsin @tab [WYF](?!P) @tab 1 -## @item Glutamine C @tab [ED](?!P) @tab 1 -## @item Lysine C @tab [K](?!P) @tab 1 -## @item Aspartic acid N @tab D @tab 1 -## @end multitable -## -## @seealso{rebasecuts, restrict, seqshowwords, regexp} -## @end deftypefn - -## Author: Bill Denney <bill@denney.ws> - -function [fragments, s, lengths] = cleave (seq, pattern, pos, varargin) - - if isstruct (seq) - seq = seq.sequence; - endif - if (rows (seq) != 1) - error ("seqreverse: the sequence must have a single row"); - endif - if (! isempty (varargin)) - ## FIXME: add support for partialdigest - error ("cleave: additional parameters are not yet supported") - endif - - s = [0 (pos - 1 + regexp (seq, pattern))]; - ## There is no cleavage at the end-- there was already a cleavage - ## there. - if (s(end) == length(seq)) - s(end) = []; - endif - fragments = cell (size (s)); - lengths = [s(2:end) length(seq)] - s; - for i = 1:numel (s) - fragments{i} = seq(s(i)+1:s(i)+lengths(i)); - endfor - -endfunction - -## Tests -%!test -%! site = [0 6 7 41 46 67 74 80 92 100]; -%! len = [6 1 34 5 21 7 6 12 8 10]; -%! frag = {"MGTGGR" "R" "GAAAAPLLVAVAALLLGAAGHLYPGEVCPGMDIR" "NNLTR" \ -%! "LHELENCSVIEGHLQILLMFK" "TRPEDFR" "DLSFPK" "LIMITDYLLLFR" \ -%! "VYGLESLK" "DLFPNLTVIR"}; -%! seq = strcat (frag{:}); -%! [tfrag, tsite, tlen] = cleave (seq, '[KR](?!P)', 1); -%! assert (tfrag, frag) -%! assert (tsite, site) -%! assert (tlen, len) -## FIXME: This needs a test for partialdigest.
--- a/main/bioinfo/inst/int2aa.m Mon Jun 17 16:08:09 2013 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,38 +0,0 @@ -## Copyright (C) 2008 Bill Denney -## -## This software is free software; you can redistribute it and/or modify it -## under the terms of the GNU General Public License as published by -## the Free Software Foundation; either version 3 of the License, or (at -## your option) any later version. -## -## This software is distributed in the hope that it will be useful, but -## WITHOUT ANY WARRANTY; without even the implied warranty of -## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU -## General Public License for more details. -## -## You should have received a copy of the GNU General Public License -## along with this software; see the file COPYING. If not, see -## <http://www.gnu.org/licenses/>. - -## -*- texinfo -*- -## @deftypefn {Function File} {@var{sequencechar} =} int2aa (@var{sequenceint}) -## Convert amino acid integers into characters. See @code{aminolookup} -## for the listing of numbers and their corresponding characters. -## @seealso{aminolookup,aa2int,int2nt,nt2int} -## @end deftypefn - -## Author: Bill Denney <bill@denney.ws> - -function seqc = int2aa (seqi) - - if (! isnumeric (seqi)) - print_usage (); - endif - sequences = aminolookup ("code", []); - seqc = reshape (sequences(seqi), size (seqi)); -endfunction - -## Tests -%!assert (int2aa (1), "A") -%!assert (int2aa ([1 2]), "AR"); -%!assert (int2aa ([1;2]), ["A";"R"])
--- a/main/bioinfo/inst/seqreverse.m Mon Jun 17 16:08:09 2013 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,47 +0,0 @@ -## Copyright (C) 2008 Bill Denney -## -## This software is free software; you can redistribute it and/or modify it -## under the terms of the GNU General Public License as published by -## the Free Software Foundation; either version 3 of the License, or (at -## your option) any later version. -## -## This software is distributed in the hope that it will be useful, but -## WITHOUT ANY WARRANTY; without even the implied warranty of -## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU -## General Public License for more details. -## -## You should have received a copy of the GNU General Public License -## along with this software; see the file COPYING. If not, see -## <http://www.gnu.org/licenses/>. - -## -*- texinfo -*- -## @deftypefn {Function File} {@var{seqr} =} seqreverse (@var{seqf}) -## Reverse a nucleotide sequence. @var{seqf} can be either a numeric or -## code DNA or RNA sequence or a struct with the field sequence, and it -## will reverse the direction of the sequence. -## @seealso{seqcomplement, seqrcomplement} -## @end deftypefn - -## Author: Bill Denney <bill@denney.ws> - -function s = seqreverse (seqf) - - if isstruct (seqf) - s = seqf.sequence; - else - s = seqf; - endif - - if (rows (s) != 1) - error ("seqreverse: the sequence must have a single row"); - endif - s = fliplr (s); - if isstruct (seqf) - seqf.sequence = s; - s = seqf; - endif -endfunction - -## Tests -%!assert (seqreverse ("ATCG"), "GCTA") -%!assert (seqreverse (struct ("sequence", "ATCG")), struct ("sequence", "GCTA"));